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1.  Preface 


The  purpose  of  this  document  is  to  provide  an  overview  of  IRL'S-II  and  assist  developers  interested  in 
adapting  IRUS-II  to  new  application  domains  Chapter  2  provides  a  general  introduction  and  overview  Chapter  3 
describes  the  use  of  domain  models  in  IRUS-II:  it  provides  suggestions  for  the  development  of  new  domain  models 
and  a  description  of  the  syntax  for  the  development  of  domain  models  with  N1KL.  Chapter  4  descnbes  the  Semantic 
Interpretation  Rules  ot  IRUS-II.  Chapter  5  presents  the  categones  and  features  of  the  IRUS-II  dictionary  .  Chapter  6 
descnbes  the  World  Model  Language  (WML),  the  logical  representation  language  of  IRUS-II.  Chapter  7  descnbes 
using  a  new  knowledge  acquisition  tool  that  speeds  development  of  syntactic  and  semantic  information  for  the 
dictionary.  Chapter  8  describes  the  process  of  translating  a  WML  expression  into  a  form  that  one  or  more 
underlying  systems  can  execute.  Finally.  Appendix  A  provides  instructions  for  loading  the  IRUS-II  software. 

This  document  supercedes  an  earlier  version.  BBN'  Report  6989. 
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UI1S  Systems  and  I  et  hm  duties  <  urpor  a  turn 


2.  Introduction 

2.1  BBN  Natural  l  anuna^c  wstoms  -  (hcrxicu 

2.1.1  Introduction  and  Hack "round 

BBN  hxs  developed  an  evolutionary  sequence  ot  siate-ot-tlie-art  natural  language  interlaces  xs  part  ol 
DARPAs  Strategic  Computing  Program  IRIS  S .  '<)[  the  first  in  that  sequence,  was  tlK-  result  <>('  sears  ot 
DARPA -  funds d  research  IRI  S  uxs  installed  an. I  demonstrated  -it  CISC  PACKET  in  14Xn  xs  pan  •>!  ilk  Meet 
Command  Center  Battle  Management  Progi.im  it  w  .is  integrated  with  the  F’RtSH  environment  at  CINCPACTLT 
xs  a  natural  language  interlace  to  Navy  dat.iha-e-  U>B  and  to  a  mapping  and  report  generation  s\  stem  < < >Si  it* ■ 
The  initial  version  cd  the  second  evolutionary.  tut  u al  language  system.  1RI  S-II  was  demon'll  tied  in  May  Ids'  a- 
p.irt  of  the  Janus  sv'tem  The  PA"  version  -  t  I  inu-  integrated  understanding  capabilities  developed  by  BBS  vv uh  a 
generation  system  develojv  I  h>  IS! 

The  current  version  of  the  Janus  system  w  xs  first  dcnmnsti.ited  m  March  l'*xx  xs  a  seamless  osi.-n 
integrating  I  .men  age  menus,  maps  tables  and  pointing  Janus  now  ir.coqvrate  -  the  IRl  S-II  under- land  ■■  g  v  >t.t, 
and  the  SPOKESMAN  natural  language  generation  system  II  \  1 5 1 .  used  tor  paraphrasing  the  natural  language 
input,  generating  English  responses  to  queries  and  clarifying  ambiguous  or  partially  understood  input- 

A  tc'.Ccted  bibliography  ot  papers  and  reports  ,.n  BBN's  natural  language  pri'iects  is  provided  at  tJie  end 
this  document 

2.1.2  IRl  S-II  Components 

In  IRl  S- 1!  1  xs  in  I R  l  Si.  the  natural  lane  u  a  re  capability  is  distinguished  hoih  I  tom  the  soiuepiu  al  v  ie  w  ot  >he 
domain  and  from  the  details  of  the  particular  expen  system  or  application  program  heme  used 

2. 1.2. 1  I  rout  I  nd  ( ompoiunls  of  IK l  S-II 

Tlie  ni.uor  domain-independent  trout  end  modules  of  the  IRl  S-II  system  include  a  parser  and  associated 
grammar  a  semantic  interpreter,  and  a  discourse  subsystem  tor  resolving  anaphoia  and  elhpst.  These  module- 
function  to  parse  an  English  query  or  command  into  meaningful  phrases  and  structures,  which  a  a*  translated  into  a 
formal  semantic  representation  which  represents  the  meaning  of  English  sentences  in  a  higher  order  intensional  logic 
called  World  Model  Language  tWMLi  ([II!  and  chapter  hi  WML  is  essentially  independent  of  the  target  system's 
command  language  and  its  constants  are  concepts  from  a  domain  model  'chapter  3t  The  domain  model  contains 
information  about  concept-  and  relations  between  concepts  in  a  specific  domain  IRI  S-U  currently  uses  the 
MKL  [16]  knowledge  rep  re  -entaiion  'omul  ism  to  represent  the  domain  model 
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Addition:!!  ont  end  knowledge  bases  contribute  tn  the  syntactic  semantic  and  discourse  processing  o| 
English  re  cr  A  lexicon  i chapter  5i  contains  the  mtomiution  about  word  and  phrase  substitutes  par's  . >t  speech 
and  s'.itactis.  semantic,  and  morphological  teatuies  needed  tor  parsing  A  set  ot  semantic  interpr eiati  u  rule- 
Unapter  4i  specif)  mappings  belueen  English  constructs  and  concepts  and  relations  in  the  application  dotnaiti 
These  are  used  to  determine  whether  the  incremental  syntactic  structures  produced  b>  the  parser  are  micrprviabic 
structures  and  to  specif)  their  imernretalion  These  rules  lor  generating  WML  depend  on  inhuman  at  contame  I  iri 
tlx.*  domain  model  The  dtscouise  component  uses  a  set  of  rules  which  generate  discourse  entities  based  or.  die 
structure  ot  the  WML,  and  a  separate  set  d  co-reference  constraint  rules  for  lntra-sentential  anaphora  resolution 

2. 1.2.2  Hack  Krtd  Components  of  IRI  S-II 

The  t.nh  of  the  bad.  end  components  ot  IRI  S  11  is  to  take  a  M  ML  c  xpression  repre  -enting  the  meat  :i g  o'  .> 
quer.  and  compute  the  correct  command  or  set  ot  commands  to  one  or  more  underl)  me  s)  stems,  obtaining  (he  result 
requested  b)  the  user  This  problem  is  decompo.cd  into  the  following  steps 

•  Simphb.  the  WML  expression  where  possible 

•  Iden'it)  the  under!) mg  s\ stem1  -  which  iiu>  contribute  to  she  satisfaction  ot  the  user's  request,  m.iktr  z 
use  ot  description*  of  underlung  s\ stem  capabilities 

•  Fomtiilate  a  sequence  of  calls  to  the  underlying  svstenvsc  attempting  to  mitumi.’e  cost  eft  >r  m 
instances  where  system  capabilities  twerlap 

•  Execute  the  call  sequence 

•  Generate  a  response  which  will  be  helpful  ui d  mh  nn.iinc  tor  the  user 

Chapter  S  explains  these  steps  in  more  detail  The  major  modules  in  the  backend  correspond  to  these  steps 
and  include  a  module  which  applies  tr.uislaiion  rules  io  an  expression  and  simplifies  the  result,  a  module  that 
dediues  which  underlying  swetie  s>  should  be  utilized  a  module  that  constructs  the  appropriate  calling  sequeiu, 
and  a  module  that  executes  the  cede  and  makes  sense  of  the  results 

A  It  bon  el:  the  constant  >  in  Mi  are  .  i  'inept  -  tr  -rr,  the  li  main  model  the  constants  m  the  executable  code  uv 
determined  b\  the  structure  ot  the  site  s  underl;.  me  s\  stents  This  means  that  the  Lnghsh-to.WMi  and 
simplification  translation.  ,tr  e  domain  dependent  hut  underl)  tug  system  independent,  while  the  genet  at  i  ■  >n  ot  an 
executable  sequence  is  both  domain-  dependent  and  underlying-system  dependent  Therefore,  were  a  ylataba.se 
administrator  to  reorganize  the  database,  the  WML  and  its  corresponding  simplifications  would  he  unutieded  hut  it 
would  he'  necessar)  to  update  the  appropriate  underlying  system  description 
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2.1.3  The  1  Aolutimuirv  Step  from  IRl  S  to  IRl  S-II 

The  main  differences  between  IR1  S  arui  IR1  S-tt  eari  be  characren/ed  as 

1  Using  intenMi'nal  logic  <II.ito  represent  natural  language  expressions 

2  A  sophisticated  back  end  component  that  maps  intettsional  expressions  mio  ttieir  extensions  in  the 
underlying  systems.  incorporating  a  clear  methodology  wind!  uses  declarative  knowledge  abom 
underlying  system  capabilities. 

3.  A  new  discourse  component  which  geneiates  discourse  entities  (  "things  one  can  refer  to  d  from  the 
meaning  representation  of  a  sentence  .uid  handles  references  intrinluced  by  pointing  gestures  i n  a 
manner  consistent  w  nti  the  treatment  of  linguistic  ti  e  pronouru  relerences. 

4  The  ncorporation  of  K.NACQ  |33].  with  its  general  rules  for  treating  attributes  i chapter  " i 

1. 1.3. 1  lnUnsion.il  I  oific 

One  of  the  difficulties  in  developing  a  system  of  logical  representation  for  natural  language  utterances  o  that 
traditional  first  order  logic  re  g  the  MRI  representation  of  IRL  S  i  is  more  limited  in  Us  expressive  range  than 
natural  language  is.  Intensnm.il  iogic  re  g  .  the  WNIL  representation  of  [RUS-IIi  represents  one  attempt  to  broaden 
the  expressive  range  ot  logic  that  it  covers  a  broader  scope  of  natural  language  Intension.!)  logic  can  represent 
expressions  w  hose  v  alue  i  extension  >  \  .me'  depending  on  time  t  a  charactenstic  ty  pical  of  rea!  w  orld  applic  at  ions ,ts 
in  "What  were  Frederick's  last  3  positions  *  It  can  also  represent  propositions  which  are  either  true  in  some 
possible  world,  or  necessanly  true  in  all  possible  worlds,  which  allows  for  representing  the  semantics  ,-i 
hypothetical,  or  "what-if  situations,  as  in  Suppose  Fredenck  were  C3' .  as  well  as  modality  teg.  powihhf. 
necessity  i.  as  in  "Can  tlie  unit  amv  e  here  w  itfmi  2< >  hours  ’ 

WM1  now  allows  us  to  represent  predicates  or  operators  that  take  intenstonal  arguments,  such  as  CllAViL 
or  REQULS1  We  now  represent  the  contribution  that  tense  provides  to  tlie  detennination  of  an  evaluation  time 
index,  by  using  operators  such  as  PAST  and  FUTURE  A  umfomi  treatment  of  plurals  was  also  established  so  that 
singular  nouns  generate  the  standard  interpretation,  and  plurals  generate  a  POWER  operator  acting  on  a  so', 
allowing  for  collective  as  well  -  distributive  readings  |2i.  2 2 ] 

In  order  to  generate  WML  bonus,  the  semantic  component  of  1RUS  w.t>  modified  Tlie  clear  interlace 
between  separate  syntactic  and  semantic  components  allowed  for  changes  to  the  semantic  interpreter  without 
perturbing  other  modules  A  new  operator.  FULL-INTERP  is  now  allowed  in  IRules  to  indicate  that  a  constituem 
should  be  taken  intensionally .  While  some  of  the  default  IRules.  one  dealing  with  tense,  for  example,  had  to  be 
modified,  the  IRules  and  lexical  entries  that  were  domain  dependent  remained  exactly  the  same,  reminding  us  ot  tU 
advantage  of  encoding  knowledge  decl.iralively 

WML  is  a  strongly  typed  logical  language  The  formal  syntax  and  set-theoretic  semantics  of  WMF  a  a* 
described  in  [  1 1  ].  Further  discussion  of  WMF  is  in  Chapter  h. 

The  World  Model  Language  (WML>  representation  of  a  query  represents  the  most  abstract  stage  in  the 
processing  of  a  query  by  IRl  S-II  While  the  generation  ot  WML  is  dependent  on  the  svntax  and  semantics  ot  the 
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natural  lancuaee  c]uery .  the  generation  ot  a  query  in  the  language  ot  the  underlying  system  trom  the  WM! 
representation  ts  dependent  on  tlie  inteitace  to  the  underlun  system,  the  lunctit  nalny  ot  that  system,  and  the' 
structure  ot'  its  database  or  know  hedge  base. 

The  WML  express)  n  produced  b\  the  hunt  end  is  the  input  to  the  ha  end  >10  he  translated  into  executable 
code  1  and  the  input  to  the  Paraphra>er  > to  paraphrase  the  system  interpretation  ot  the  input  > 

2. 1.3.2  A  Belter  Handling  t>r  Mappirv  10  Multiple  l  nderlyin^  Systems 

Tlie  previous  veneration  ot  1KLS  (eg.  .ls  used  in  the  IRLS.OSGP  demonstration  sy  stem  1  made  use  ot  a 
limited  number  of  underlying  systems,  choosu-g  amongst  them  vi  1  a  small  set  of  key  words  embedded  in  the 
representation  <  f  a  querv  The  !Rl  S-U  natural  language  interlace  makes  the  mtcgi.ition  ot  multiple  underlying 
systems  transparent  not  onlv  tc  die  user  rets  in  the  I K. i  S  systenn  but  to  finish  ot  die  natural  language  system  as  well 
The  multiple  underh  ing  svstems  component  tal  es  an  application- 0  stem  independent  representation  ot  the  utierure.' 
(the  WML  expression  in  a  simplified  (omit  and  determines  which  subexpressions  tal!  v  dm.  the  capabilities  >1 
which  application  systems 

The  multiple  sv  stems  moilale  embodies  a  methodology  for  combining  any  number  of  applis.ition  systems  No 
assumptions  are  made  about  the  existence  or  absence  ot  overlapping  data,  ere  l  nderlytng  system  capabilities  are 
represented  declaratively.  arid  a  decision  procedure  which  seeks  to  minimi/e  cost  is  used  to  resolve  cases  in  which 
more  than  one  system  is  capable  of  handling  a  single  subexpression 

2. 1.3.3  A  Next  Discourse  Component 

A  new  discourse  component  was  built  for  IRl  S-li  which  centers  around  two  m.un  notions: 

•  The  use  of  a  general  representation  lor  ''communicative  acts',  ir  events  that  happen  during  the  user- 
machine  interactions.  These  include  inputs  by  die  user,  pointing  actions  by  the  user,  system  responses, 
and  other  system-initiated  communications 

•  The  use  of  discourse  entities  [  M .  ?2|  .is  the  fundamental  unit  for  representing  items  that  are  introduscd 
in  the  discourse  and  which  may  he  referred  to  Discourse  entities  may  be  introduced  by  an  utterance  *a 
"lingut.stjc  communicative  act”  1.  or  by  the  system  eg.  an  object  in  a  table  response  win  answer 
communicative  act  >.  The  role  ot  discourse  entities  in  Janus  is  discussed  in  [3|. 

The  representation  of  discourse  state  in  Janus  now  consists  ol  a  stack  of  communicative  act  structures,  w  ith  the  most 

recent  on  top.  and  a  structure  representing  die  tocus  space. 

Eash  communicative  act  contains  focusing  information  ia  tocus-mlo  structure  1  which  represents  how  the 
communicative  act  affected  the  items  in  locus  We  use  centering  algorithms  [K.  ]  to  track  die  movement  ol  locus 

Each  focus-info  structure  contains  fields  for  the  "back  center",  "forward  centers  .  and  center  movement  status, 
eg.  continuing  or  retaining.  The  focusing  information  is  used  in  resolving  external  anaphora,  where  the  possible 
referents  are  taken  from  the  list  of  forward  centers  A  linguistic  communicative  .ret  1  representing  a  user  s  input  1  in 
addition  contains  fields  lor  the  parse  tree.  WML.  discourse  entities,  and  anaphor  resolution  information  lor  that 
input 
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The  separate  representation  of  a  focus  space  stack  [9]  is  meant  to  contain  focusing  information  regarding  a 
segment  of  a  hierarchical  representation  of  discourse  Since  currently  «e  have  a  flat  representation  of  discourse 
state,  the  focus  space  stack  merely  contains  the  focus- into  for  each  communicative  act 

Treatment  of  intra-sententi.il  anaphora  was  also  added  to  the  discourse  component.  Syntactic  constraints  .m 
represented  in  the  c-command  rules  of  Reinhart  [id,  .ire  stated  declarauvely  as  rules  and  applied  in  determining 
possible  antecedents 

When  more  than  one  possible  referent  is  found  for  an  anaphor.  the  options  are  presented  to  the  user,  who 
indicates  which  to  use. 


2.2  IRLS-II  Capabilities 


2.2.1  Linguistic  Capabilities 

IRL'S-II  handles  a  vers  wide  range  of  English  structures,  including  the  following: 

•  Pronouns.  The  IRL'S-II  system  allows  many  common  uses  of  "it",  that  .  "he”,  etc  to  refer  to  items 
that  have  been  introduced  in  the  discourse,  ic  .  discourse  entities  introduced  by  previous  queries  or 
pointing  aetioas.  It  also  understands  pronouns  referring  to  items  within  the  sentence  untra-sententially  i 
For  example.  "Did  Frederick  report  readiness  as  Cl 

•  Referring  Expressions.  The  IRI  S-II  system  will  allow  common  use  of  phrases  such  as  that  VP 
squadron". '  those  ships  ',  etc.  to  refer  to  items  in  previous  queries. 

•  Ellipsis.  People  do  noi  always  use  complete  sentences  to  express  their  thoughts.  It  1RLS-I1  finds  a  > 

utterance  that  is  not  a  complete  sentence  but  is  a  complete  noun  phrase  or  prepositional  phrase,  it  wil 
attempt  to  understand  it  elliptical!'  b>  looking  back  at  the  previous  query  for  a  corresponding  NP  or  PI 
constituent  that  has  a  similar  semantic  type,  and  substituting  this  new  utterance  for  it.  For  example, 
after  "Show  the  T.AGOS  .  the  input  "Squadrons",  will  he  interpreted  to  me  .in  "Show  the  squadrons" 
However,  tlx-  approach  is  Imnte  I  because  the  substitution  is  ot  full  constituents  only,  which  may  not 
lead  its  the  intended  reading  In  the  following  elliptical  query  sequence  "Which  Cl  earners  are  in 
port’  Subs’  it  is  possible  that  the  user  wants  the  set  of  submarines  to  be  limited  to  those  tha'  areCl. 
but  IRLS-II  would  substitute  the  noun  phrase  "submarines”  for  the  phrase  Cl  earners' .  rather  dun  )U't 
for  the  noun  "earners' ".  and  the  answer  would  include  all  submarines  in  port,  rather  than  just  those  that 
are  Cl  The  correct  decision  in  such  cases  depends  on  semantics  in  subtle  ways  Here  is  an  example 
where  the  modifier  probably  should  not  be  included  in  the  query  built  from  die  ellipsis  Which  Cl 
ships  are  harpoon  capable  ’  "C2  subs  ’  IRLS-II  currently  only  tnes  for  interpretations  by  substituting 

complete  constituents.  A  paraphrase  of  the  elliptical  query  will  show  how  the  ellipsis  was  resolved 

•  Quantification.  Words  like  all'  "any",  "some"  are  handled 

•  Partitives.  Partitions  of  sets  such  as  "3  of  the  <x>"  or  "5  of  the  fastest  <x>  are  handled 

•  Conjunction  Disjunction.  IRLS  covers  cases  of  conjunction  where  the  conjoined  constituents  are 
either  complete  clauses  e  g  "<Display  the  T.A(jf)S>  and  <run  the  SPA  model'--"),  complete  noun 
phrases  te  g  Show  <the  VP  squadrons  -  ami  <the  surface  ships>"i.  or  complete  prepositional  phtases 
te  g  "Which  ships  are  Ml  <on  ASW -»  or  <on  AAW>’  i  It  does  not  in  general  cover  the  more 
complicated  \vnt.ntn  ca-e-  when  the  coii|uncts  depend  on  e.reh  other  by  sharing  elements  te  g.  last 
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and  display  the  Cl  earners  List  the  TACOS  that  were  C2  but  are  now  Cl  ).  There  is  often  more  than 
one  possible  interpretation  of  a  conjunction,  and  people  rely  on  semantic  and  pragmatic  information  in 
understanding  Thus,  most  people  would  assume  that  the  speaker  of  the  following  sentence.  '  List  the 
earners  in  Hawaii  and  San  Diego",  intended  to  get  a  combined  list  of  the  earners  that  either  are  in 
Hawaii  or  in  San  Diego,  as  if  it  were  list  the  carriers  in  Hawaii  and  [list  the  earners]  in  San  Diego  ' 
However,  handling  this  style  of  conjunction  in  a  natural  language  interface  is  a  research  issue,  since  it 
requires  reali/ang  that  and"  functions  as  an  "or".  Postmodi tiers  are  usually  grouped  with  the  closest 
possible  constituent  Thus  "List  the  fn gates  and  earners  in  the  IO".  would  usually  be  interpreted  first  as 
"List  <the  fngates>  and  <the  carriers  m  the  !()>"  listing  all  the  frigates  every  where  A  special  provision 
exists  in  IRIS-11  for  noun  phrases,  however,  so  that  if  the  previous  conjunets  have  not  been 
postmoditied.  a  postmodifier  on  the  final  conjunct  will  be  distnbuted.  yielding  "List  cthe  frigates  |in 
the  IOj>  and  <the  earners  [in  the  IO [>  . 

•  Negation.  For  example.  "Which  subs  am  not  Cl  (Quantification  ami  negation  together  often  causes 
ambiguity ). 

•  Comparisons  and  Superlatives.  IRI  S  II  handles  common  comparative  forms  such  as  "Is  Frederick 
faster  than  Spica’".  as  well  as  simple  superlative  constructs  such  as  "Which  is  the  fastest  carrier  in  the 
Indian  Ocean  ’". 

•  Units  and  Amounts,  e  g..  "3  knots".  "50  mph" 

•  Times  and  Dates.  These  are  understood  in  a  wide  variety  of  formats,  such  as  "  lOOUZ".  "May  31.  D)XX". 
"3 1/5/88”.  and  "2s  io3 1  May  XX”. 

•  Numerical  and  Distance  Calculations,  eg.  "Total  the  number  of  planes  on  all  earners  ",  How  far  is 
Fredenck  from  Hawaii  ?".  'List  the  TAGOs  within  20  miles  of  SPA  2” 

•  Hypothetical  World  queries.  IRl'S-I!  is  being  extended  to  understand  hypothetical*  like  "What  will 
be  the  arrival  time  of  the  unit  if  it  needs  to  refuel  once  ’" 


2.2.1  Handling  \mhiouititcs 

Understanding  English  in  light  of  ambiguity  is  a  central  research  problem  In  this  section,  we  desenhe  the 
various  kinds  of  ambiguity  and  the  cases  handled  in  IRCS-II  There  are  at  least  seven  kinds  of  ambiguities  that  may 
occur  in  natural  language.  These  are: 

1.  Semantic  .Ambiguity  -  for  example,  the  query  "When  will  we  see  California?"  could  refer  to  either  to  the 
nuclear  surface  ship  California  or  California  State 

2.  Structural  Ambiguity  -  In  the  query  "Display  the  frigates  and  earners  that  are  Cl".  "Cl"  may  be  modifying 
either  "earners'"  or  "frigates  and  earners 

3.  Ambiguity  in  Referring  Expressions  -  after  the  query  "Which  TAGOS  are  assigned  to  SPA  1  and  SPA  2  ’ 
the  command  "List  their  locations'  could  refer  either  to  the  locations  of  the  TAGOS "  or  to  the  locations  of  "SPA  1 
and  SPA  2  " 

4.  Conjunction  and  Disjunction  Ambiguity  of  AND/OR  -  Aside  from  the  above  modifier  distribution 
ambiguity,  conjunctions  can  introduce  additional  ambiguity  -  that  of  AND/OR  interpretation.  The  query  "Which 
SPAs  are  alerted  and  within  50  miles  of  Hawaii  ’"  could  refer  either  to  "SPAs  that  are  both  alerted  and  are  within  50 
miles  of  Hawaii  .  or  to  SPAs  that  are  either  alerted  or  within  50  miles  of  Hawaii 
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6.  Quantifier  Scope  Ambiguity  -  Sentences  containing  several  quantified  noun  phrase*,  often  cause 
ambiguities  For  example,  the  sentence  "Did  all  subs  in  this  task  force  report  a  readiness  problem’  ,  could  mean 
either  Did  all  subs  in  this  task  force  report  ;/»«•  same  readiness  problem  or  "Did  each  sub  in  this  task  force  report 
at  least  one  readiness  problem  < but  the  problem  may  differ  from  one  sub  to  another)’ 

7.  Collective , ■'Distributive  Ambiguity  -  For  example,  the  query  "Are  Frederick  and  Spica  communicating’ 
can  be  interpreted  as  "Are  Frederick  and  Spica  communicating  with  each  other’"  or  as  "Is  Frederick  communicating 
<w  ith  something:*  and  is  Spica  communicating  <vvith  something:*"  These  interpretations  arise  because  of  the 
collective  reading  property  that  the  verb  "communicate"  possesses. 

Currently  ,  the  1RCS-II  system  can  completely  identify  and  present  to  the  user  (in  paraphrasing  model  only 
semantic  ambiguity  stemming  from  more  th.in  one  word  sense  Multiple  WMLs  are  generated  in  this  case 

Structural  ambiguity  i-.  detected  by  the  parser.  However,  only  the  first  structural  interpretation  passing 
semantic  tests  is  presented  to  the  user  The  mechanisms  for  requesting  another  parse  exist,  hut  have  not  been 
incorporated  into  the  user  interface  A  call  to  t next- WML  i  by  a  system  developer  obtains  the  next  >y  ntactic  reading 

Referring  expression  ambiguity  is  detected  and  the  options  are  presented  to  the  user 

Ambiguities  regarding  conjunction.'disjunction  and  quantifier  scope  are  not  detected  However  lor  these 
cases  we  may  "envision  additional  interpretations  although  they  have  not  been  actually  produced  by  the  Parser,  and 
propose  those  interpretations  (perhaps  via  a  Paraph  risen  to  the  user  For  example,  in  cases  of  conjunctions  that  are 
premodified  (Type  3).  we  can  "envision"  (or  derive")  the  WML  that  corresponds  to  the  premodifier  distribution, 
from  the  existing  WML  that  corresponds  to  the  interpretation  in  which  the  premodifier  is  attached  only  to  the  first 
NP  land  vice  versa i. 

Collective  distributive  ambiguity  is  currently  handled  by  having  plurals  always  generate  collective  readings 
simplification  of  the  WML  expression  by  the  back-end  includes  performing  distribution  when  appropn.ite  (that  is. 
when  the  predicate  expect*  indiv  iduals.  but  --  because  of  a  plural  --  appears  to  be  apply  mg  to  a  set  i. 


2.2.3  Handling  l.an<iua<ic  Peculiarities  and  Anomalies 

Ill-formedness:  The  IRI  S-11  system  can  handle  certain  types  of  sentences  that  are  not  fully  grammatical 
The  parser  only  Semites  for  an  ill- formed  interpretation  if  it  is  not  able  to  find  a  normal  one  The  classes  of 
iU- formed  input  currently  handled  are 

1  omitted  articles,  e  g  .  "List  units  in  Pacific" 

2  missing  prepositions,  e  g..  "List  units  Pacific  " 

3  subject  verb  disagreement,  e  g  "Which  battle  group  of  earners  .ire  in  the  Pacific  ’" 

4  detenmner  number  disagreement,  e  g  .  "Is  a  units  near  Hawaii  ’  ,  and 

5  imoirect  pornouri  case,  e  g  ."Display  the  unfriendly  units  between  if  and  they  " 
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Spelling  Correction:  A  spelling  corrector  is  invoked  whenever  1RI  S-1I  cannot  make  sense  of  a  word  that  the 
user  his  typed  ti  e.,  the  word  the  user  has  typed  is  not  in  the  dictionary  and  it  does  not  appear  to  be  an  inflected  form 
of  a  word  in  the  dictionary  ).  The  spelling  corrector  proposes  a  possible  list  of  re-spelled  words  front  which  the  user 
can  choose  the  desired  w  ord 

2.2.4  Mapping  to  Multiple  l  'ndcrl\m*>  Systems 

One  of  the  important  features  of  the  IRl'S-FI  system  is  that  it  can  provide  access  to  multiple  under  I  vine 
application  systems  at  a  single  installation.  This  access  is  transparent  to  the  user  and  has  been  made  transparent  to 
much  of  the  natural  language  sy  stem  as  well  This  process  w  as  briefly  discussed  in  section  2. 1 .3.2. 

A  paradigmatic  case  that  demonstrates  accessing  multiple  systems,  is  the  CINCPACFLT  installation  of  the 
IRUS-kh  system  in  which  both  a  Navy  database  ilDBi  and  a  Mapping  system  (OSGPi  are  accessed  Each  has  its 
own  data  tiles,  with  the  tiles  of  LDB  being  the  more  extensive.  The  interface  to  those  systems  was  effectively 
combined  in  an  integrated  natural  language  interface  that  was  able  to  transfer  data  found  only  in  IDB  to  make 
displays  on  OSGP 

2.2.5  Mixed  Modality  Interaction 

The  natural  language  mode  of  input  can  be  integrated  w  ith  other  kinds  of  input  such  as  graphics.  Speciticalh . 
when  demonstrated  as  pan  of  Janus,  an  integrated  system  that  included  a  map  with  mousable  icons.  IRl'S-II 
provided  a  means  for  mouse  clicks  on  map  icons  to  be  remembered  as  entities  which  could  then  be  referred  to  bv 
deictic  expressions  in  natural  language  input  For  example,  one  could  click  on  three  different  .submarines  and  then 
type  in.  "Assign  those  to  SPA  2  This  kind  of  mixed-mode  interaction  is  an  important  pan  of  the  seamless 
character  of  an  interface,  allowing  die  user  to  move  fluidly  between  modalities,  so  that  the  graphical  display  reflects 
the  result  of  understanding  and  responding  to  the  natural  language  query .  and  die  query  in  turn  can  refer  to  graphical 
entities 
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3.  Building  an  IRUS-II  Domain  Model 

3.1  Introduction 


The  domain  model  provides  a  link  between  the  user’s  view  of  the  domain  and  the  underlying  system's 
representation  of  that  domain.  Once  the  system  has  interpreted  the  user's  query  in  terms  of  a  logical  representation 
that  uses  entities  from  the  domain  model,  translation  rules  are  used  to  convert  that  logical  representation  into  the 
query  language  of  the  underlying  system.  This  approach  effectively  isolates  the  user  from  the  technical  details  of  the 
underlying  system  and  the  formal  query  language  of  the  underlying  structure. 

Since  the  first  implementation  of  IRUS-11  was  designed  to  support  database  queries,  the  remainder  of  this 
chapter  will  assume  a  database  query  application.  Regardless  of  the  functionality  of  the  underlying  system,  the 
requirements  for  developing  a  domain  model  are  similar.  Additionally,  applications  other  than  database  quenes  may 
require  the  implementation  of  new  operators  that  were  not  in  the  first  implementation  of  IRUS-11.  such  as  "WHAT- 
IF'  for  hypothetical  quenes  posed  to  expert  systems. 

A  useful  distinction  that  can  be  drawn  in  the  analysis  of  a  natural  language  input  utterance  is  the  distinction 
between  the  denotative  aspect  of  the  utterance  and  its  performative  aspect.  The  denotative  aspect  refers  to  the 
objects  in  the  domain  that  are  referred  to  by  the  utterance.  The  performative  aspect  refers  to  the  action  that  the 
system  is  being  instructed  to  perform  in  regard  to  those  objects.  For  example,  consider  the  utterance: 

Display  the  locations  of  the  Cl  units. 

Displav  represents  the  performative  aspect  of  this  utterance.  The  interpretation  of  Display  depends  on  the 
functionality  of  the  underlying  system.  The  remainder  of  the  sentence  (its  denotative  aspect),  on  the  other  hand,  can 
be  intepreted  independently  of  the  functionality  of  the  system.  In  one  implementation.  Display  may  result  in  a 
tabular  listing:  in  another  implementation  it  may  result  in  a  graphic  presentation  of  a  map.  But  the  logical 
representation  of  the  denotative  aspect  of  the  utterance  is  independent  of  system  functionality. 

While  additions  to  the  performative  support  provided  by  ERUS-II  currently  require  the  expertise  of  the 
IRUS-II  developers,  tools  are  available  to  make  the  addition  of  new  denotative  support  (i.e..  a  new  domain  model)  a 
much  simpler  matter. 
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3.2  Domain  Models 


A  domain  model  is  a  conceptual  description  of  a  subset  of  the  entities  in  the  world.  The  relevant  subset  is 
determined  by  the  application  of  the  system  For  example,  a  system  that  will  be  used  to  access  a  database  conrurung 
information  on  a  Navy  unit’s  equipment  inventory  will  contain  references  to  items  of  Navy  equipment  —  weapons, 
ships,  ports,  helicopters,  etc.,  and  to  relevant  data  about  that  equipment  —  location,  maintenance  status,  model, 
identification  number,  cost,  etc. 

The  entities  in  a  domain  model  are  concepts  and  roles.  Each  concept  represents  a  class  of  things,  and  typically 
is  named  for  this  class  of  things  Roles  represent  relationships  in  which  those  concepts  participate.  For  example,  the 
concept  SHIP  represents  the  class  of  things  that  are  ships.  Associated  with  it  may  be  several  roles  such  as  SHIP- 
NAME,  ID-NUMBER.  COMMANDER,  HOME- PORT.  For  each  of  these  roles,  the  class  of  things  that  are  ships  are 
the  domain.  Each  of  these  roles  also  has  a  range  —  the  class  of  things  into  which  they  map.  For  example,  the  range 
of  SHIP-NAME  could  be  the  class  STRING,  the  range  of  ID-NUMBER  could  be  the  class  NUMBER,  the  range  of 
COMMANDER  could  be  the  class  PERSON  (or  OFFICER),  and  the  range  of  HOME-PORT  could  be  the  class 
PORT. 

Once  the  domain  developer  has  identified  a  concept  that  should  be  included  in  the  domain  model,  then  a 
number  of  roles  for  which  that  concept  is  the  domain  will  immediately  come  to  mind.  In  creating  those  roles,  the 
domain  developer  will  have  to  specify  the  classes  w  hich  are  their  ranges.  These  ranges  represent  additional  concepts 
which  will  have  to  be  included  in  the  domain  model.  These  concepts,  in  turn,  will  be  the  domains  of  new  roles.  Once 
begun,  the  process  of  developing  a  domain  model  tends  to  proceed  spontaneously.  Dunng  implementation,  testing, 
and  debugging,  the  decisions  made  in  the  initial  design  stage  will  probably  have  to  be  altered  and  adjusted. 

The  domain  developer  can  approach  the  problem  of  identifying  these  roles  and  concepts  from  two  directions 
—  from  the  actual  structure  of  the  target  database  or  from  the  questions  that  the  targeted  users  would  have  for  such  a 
database.  In  fact,  both  directions  are  necessary  to  obtain  a  complete  domain  model  for  use  in  IRUS-I1  An  effective 
approach  is  to  interv  iew  the  targeted  users  and  elicit  samples  of  the  actual  questions  they  would  submit  to  such  a 
database,  making  sure  each  possible  database  table  or  field  is  covered  by  some  question.  This  will  define  the  range 
of  concepts  and  relations  people  use  in  the  domain. 

It  should  be  noted  that  the  domain  model  is  not  required  to  be  limited  to  contain  entities  that  map  to  the 
database,  but  other  related  entities  that  a  user  is  likely  to  use  should  be  there  too  in  order  to  respond  sensibly  when 
there  are  misconceptions  by  the  user  For  example,  a  target  user  of  the  database  mentioned  above  might  wish  to  ask 
a  question  such  as 

In  which  factory  was  this  helicopter  made? 

IRUS-II  should  understand  the  question  (be  able  to  obtain  a  WAIL  using  domain  predicates),  even  though  it 
cannot  answer  it  In  JANT'S-B.  a  troubleshooting  capability  is  under  development  to  be  able  to  answer  I  don’t  have 
inlormanon  about  where  helicopters  are  made 
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If  the  domain  developer  is  developing  a  domain  model  that  is  intended  to  be  transportable  to  a  number  of 
different  databases  dedicated  to  the  same  domain,  that  domain  model  should  contain  all  of  the  concepts  that  would 
possibly  be  reflected  in  those  databases.  IRUS-II  will  not  support  access  to  any  data  in  a  databxse  unless  that  data  is 
conceptually  represented  in  the  domain. 

As  an  example  of  a  domain,  let  us  assume  that  our  applicauon  is  a  hospital  database.  The  targeted  users  arc 
doctors,  nurses,  and  other  hospital  staff.  The  hospital  database  will  contain  information  about  patients,  doctors, 
nurses,  floors,  wards,  rooms,  etc.  These  will  be  the  concepts  in  the  domain  model.  One  of  the  pnnciples  behind 
taxonomic  networks  is  that  it  is  useful  to  create  concepts  that  generalize  other  concepts.  This  reduces  the  amount  of 
information  that  has  to  be  stored  by  taking  advantage  of  inheritance.  It  also  generates  concepts  that  reflect  semantic 
distinctions. 

For  example,  it  is  likely  that  we  would  want  to  be  able  to  query  the  database  about  a  doctor's  social  security 
number.  We  would  also  like  to  be  able  to  refer  to  the  social  secunty  numbers  of  nurses  and  other  staff  members. 
Since  it  is  likely  that  patient  social  security  numbers  are  used  for  identification,  we  would  also  like  to  be  able  to  refer 
to  them.  A  sensible  approach  would  be  to  create  a  concept  called  HOSPITAL-PERSON  that  is  a  superconcept  of 
DOCTOR.  NURSE.  OTHER-STAFF  and  PATIENT.  The  Role  SOCIAL-SECURITY-NUMBER  could  then  be 
associated  with  the  concept  HOSPITAL-PERSON  aid  tnhented  by  all  concepts  subsumed  by  HOSPITAL- 
PERSON. 

3.3  Building  Domain  Models  with  MkL 


Currently,  IRUS-II  domain  models  are  built  with  MKL  [  16].  MKL  (New  Implementation  of  KL-ONE)  is  a 
system  for  representing  conceptual  knowledge  With  MKL.  the  user  can  represent  know-ledge  tn  a  structured 
inheritance  network.  In  addition  to  enabling  the  user  to  describe  a  domain  with  constants  that  can  be  used  in  other 
modules  of  the  IRUS-U  system,  MKL  also  provides  inference  mechanisms  that  can  be  of  use.  such  as  inhentance 
and  classification.  A  graphical  interlace.  KREME[L2|.  is  used  to  more  easily  visualize  and  modify  a  NIKL 
network 

In  its  current  implementation.  IRUS-II  uses  the  inheritance  mechanism.  For  example,  the  Semantic 
Interpretation  Rules  (IRulesi  test  the  semantic  class  of  constituents  to  determine  if  they  are  subsumed  by  cenam 
MKL  concepts.  These  tests  can  name  the  highest  level  concept  for  which  they  are  appropriate,  w  ith  confidence  that 
the  inhentance  mechanism  will  correctly  determine  whether  the  individual  being  tested  is  a  member  of  that  class  or 
any  class  subordinate  to  it  in  the  network  (subsumed  by  it  t 

Although  it  is  not  done  in  the  current  implementation.  IRUS-II  can  also  make  use  of  the  classification 
mechanism  of  MKL.  Classification  enables  NIKL  to  correctly  place  a  new  concept  in  a  network  from  the  definition 
of  that  concept.  This  makes  it  possible  to  take  information  in  a  query  and  use  it  to  create  a  MKL  concept.  This  can 
he  used  to  simplify  an  express'-m  by  reducing  a  complex  expression  to  a  NIKL  concept  For  further  details,  see  [  2b|. 
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.3.3.1  MKL  Concepts 

The  objects  in  a  NIKL  domain  are  known  as  concepts.  These  concepts  are  organized  in  a  network  that  is 
rooted  in  the  concept  THING,  which  subsumes  all  other  concepts  in  the  world.  Concept  A  subsumes  consent  R  if  all 
instances  of  B  (i.e.,  individuals  of  the  world  that  are  of  class  B  )  are  instances  of  A.  For  example,  all  animals  are 
living  things.  Therefore,  if  one  were  to  build  a  network  that  included  the  concept  ANIMAL,  the  concept  HORSE, 
and  the  concept  LIVING-THLNG.  the  concept  L1YING-THING  would  subsume  ANIMAL  and  HORSE  The 
concept  ANIMAL  would  subsume  the  concept  HORSE.  Subsumption  is  transitive,  i.e..  if  LIVING-THING 
subsumes  ANIMAL,  and  ANIMAL  subsumes  HORSE,  then  LIVING-THING  also  subsumes  HORSE.  The 
relationship  between  a  concept  and  the  concepts  that  subsume  it  corresponds  to  an  IS-A  link  in  frame  terminology. 

Concepts  can  be  either  primitive  or  defined.  Typically,  domain  development  begins  with  the  creation  of  some 
primitive  concepts  and  then  proceeds  to  the  creation  of  defined  concepts  based  on  the  existing  pnnnnve  concepts.  A 
primitive  concept  is  one  which  the  domain  developer  can  not  fully  define  in  terms  of  existing  concepts  in  the 
network. 

A  defined  concept  is  a  concept  for  which  the  necessary  and  sufficient  criteria  for  membership  in  the  class  it 
represents  can  be  expressed,  by  specify  ing  the  parents  of  the  concept  and  appropnate  role  restrictions  (the  following 
section  discusses  role  restrictions).  For  example,  the  necessary  and  sufficent  critena  for  membership  in  the  class 
RED-SNEAKER  is  that  an  object  be  a  SNEAKER  and  that  it  have  the  color  RED.  i.e..  RED-SNEAKER  is 
subsumed  by  SNEAKER  and  restncts  the  role  COLOR-GF  to  have  range  RED.  In  many  cases,  the  necessary  and 
sufficient  criteria  for  membership  in  the  class  represented  by  a  concept  can  not  be  expressed.  For  example,  consider 
the  class  TIGER.  A  TIGER  is  a  subconcept  of  ANIMAL  and  we  can  express  some  of  the  necessary  cntena  for 
membership  in  the  class  represented  by  TIGER,  but  we  can  not  define  the  necessary  and  sufficient  cntena  to 
distinguish  a  TIGER  from  all  other  members  of  the  class  ANIMAL.  TIGER  is  thus  a  primitive  concept,  a  concept 
that  we  can  describe  but  not  completely  define 

3.3.2  MKL  Roles  and  Role  Restrictions 

Roles  are  NIKL  entities  which  represent  logical  relationships  between  concepts.  A  Role  would  be  referred  to 
as  a  slot  in  frame  terminology.  A  Role  is  a  two-place  relation  whose  domain  is  represented  by  a  concept  and  whose 
range  is  represented  by  a  concept.  A  Role  maps  each  instance  of  the  domain  concept  into  instances  of  the  range 
concept.  An  instance  of  the  range  concept  is  a  filler  of  the  role  for  the  instance  of  the  domain  concept. 

Part  of  the  descnption  of  a  concept  may  include  role  restrictions.  Role  restnctions  restrict  the  allow  ed  class  of 
the  range  and/or  the  number  of  range  fillers  for  that  role,  when  the  domain  is  that  concept.  Subconcepts  inherit  both 
the  Roles  and  RoleRestnctions  of  their  superconcept v  Inherited  RoieRestnctions  can  be  further  restricted,  but  not 
generalized,  for  the  subconcept.  In  tact,  this  is  one  way  of  defining  a  subconcept.  For  example,  the  concept 
PERSON  may  have  the  Role  ADDRESS  attached  to  it  The  range  of  ADDRESS  may  be  restricted  to  addresses  The 
concept  I  SA-RESIDENT  may  be  cmphtc's  defined  as  the  subconccpt  of  PERSON  for  which  the  range  of 
ADDRESS  is  revtrivted  to  addresses  in  the  I  SA 
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The  .ii  MBER  component  of  a  role  restriction  is  particularly  important  when  defining  a  functional  role.  i.e..  a 
role  that  relates  each  element  of  the  domain  to  one  and  only  one  instance  of  the  range.  HULL- NUMBER- OF  is  one 
such  role  relating  VESSEL  to  HULL-NUMBER.  To  define  a  functional  role,  one  must  have  a  role  restricuon  with 
the  NLLMBER  component  specified  as  1  at  the  most  general  concept  that  is  the  domain  of  the  role.  IRUS-1I  does  not 
use  this  information  currently,  but  will  in  the  future  when  a  more  general  treatment  of  functional  nouns  is 
implemented 

When  one  is  developing  a  NIKL  network,  one  goal  is  to  give  roles  the  most  general  domain  and  range  that  are 
relevant,  and  when  a  role  restriction  is  needed,  to  connect  it  to  the  most  general  concept  possible 


3.4  NIKI,  Syntax 


The  two  primary  functions  used  in  creating  a  domain  mode!  with  NTKL  are  defconcept  and  defrole.  As  the 
names  indicate,  defconcept  is  used  to  create  a  concept  and  defrole  is  used  to  create  a  role.  The  following  uses 
standard  BNF  syntax,  w  here  {  )  denotes  optional  items,  and  *  denotes  one  or  more  repetitions  of  an  expression. 


3.4.1  DF.F('()NCF.PT 


The  format  for  defconcept  is: 

(DEFCONCEPT  < concept -name > 

(SPECIALIZES  <concept>*} 

(PRIMITIVE) 

(INDIVIDUAL) 

{ (RES  <role>  { (VRCONCEPT  <concept>) » 

( (NUMBER  <integer>) } 

( (MIN  <integer>) )  { (MAX  <integer>) )) )* 

{(DATA  (LEX I CAL  ITEMS  (<lexical-item>)  )  )  } 


) 

PRIMITIVE 

means  that  the  necessary  and  sufficient  critena  for  membership  in  this  class  are  not  expressed  in  the  domain  model 
If  not  present,  assumed  to  be  a  DEFINED  concept. 

INDIVIDUAL 

means  that  the  concept  being  created  represents  an  individual  enuty  which  is  a  member  of  the  class  or  classes 
denoted  by  this  concept's  parents  For  example,  the  current  domain  model  includes  the  following: 

(DEFCONCEPT  TRUE  PRIMITIVE  INDIVIDUAL  (SPECIALIZES  TRUTH . VALUE ) ) 


TRUE  is  a  an  individual  ot  class  TRITH  VALUE  (the  only  oi.ier  member  of  the  TRUTH. VALUE  class  is  the 
individual  FALSE' 
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SPECIALIZES 

specifies  the  concepts  that  immediately  subsume  the  concept  being  created. 

RES 

expresses  restrictions  on  a  role  (which  is  named  immediately  after  the  keyword),  when  the  domain  of  the  role  is  an 
installation  of  this  concept  These  restrictions  are  specified  with  VRCONCEPT.  NUMBER.  MIN,  and  MAX. 

VRCONCEPT 

specifies  a  class  restriction  on  the  filler  of  the  range  of  role  The  filler  of  the  range  is  limited  to  members  of  the 
named  class. 


NUMBER 

specifies  the  exact  number  of  individuals  that  may  till  that  role  for  each  member  of  the  class.  For  example,  the 
current  domain  model  includes  the  following: 

(DEFCONCEPT  VESSEL 

PRIMITIVE 

(SPECIALIZES  PLATFORM) 

(RES  COMMANDER . OF  (VRCONCEPT  COMMANDER)  (NUMBER  1))) 

The  last  line  indicates  that  the  role  COMMANDER. OF  associated  with  the  concept  VESSEL  may  be  filled  by 
exactly  one  member  of  the  class  COMMANDER 

MIN 

Similar  to  NUMBER,  but  specifies  a  minimum  number  of  role  fillers  for  each  member  of  the  class. 

MAX 

Similar  to  NUMBER,  but  specifies  a  maximum  number  of  role  fillers  for  each  member  of  the  clas. 


DATA 

is  rarely  used.  In  conjunction  with  LEXICALITEMS  it  specifies  an  English  won!  associated  with  the  concept 
Interpretation  Rules  (discussed  in  Chapter  4i  are  connected  to  domain  objects  via  this  data  field,  but  tins  is  not 
specified  m  the  domain  model  tile,  they  are  added  by  the  IRule  loader  program. 


3.4.2  DFKROLK 


The  format  fordefrole  is: 

(DEFROLE  <role-name> 

(DIFFERENTIATES  <role-name>*) 
(DOMAIN  <concept>) 

(RANGE  <concept>) 

(PRIMITIVE } 
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DIFFERENTIATES 

specifies  a  parent  role  which  the  role  being  created  is  a  subrole  of  (a  more  specific  relation),  that  is.  if  the  parent  role 
is  viewed  as  denoting  a  set  of  ordered  parrs,  this  role's  denotation  is  a  subset  of  that  set.  The  role  being  created  must 
have  as  domain  and  range  concepts  that  are  subconcepts  of  the  parents'  respective  domains  and  ranges 

DOMAIN 

specifies  the  domain  of  the  role. 


RANGE 

specifies  the  range  of  the  role  The  role  represents  a  relation  between  elements  of  the  domain  and  elements  of  the 
range. 


PRIMITIVE 

A  primitive  role  is  one  for  which  necessary  and  sufficient  critena  can  not  be  expressed  If  not  present,  the  role  is 
assumed  to  be  a  DEFINED  role. 


3.5  KRKMK 


BBN’s  Knowledge  Representation.  Editing,  and  Modeling  Environment  (KREME)  is  an  excellent  tool  for  the 
development  of  a  domain  model.  KREME  has  its  own  frame  language  which  is  closely  related  to  N1KL.  but 
KREME  also  has  a  NIKL  mode  which  enables  it  to  work  with  domain  models  built  with  NIKL  Whether  the  domain 
model  is  created  with  NIKL  directly  or  via  KREME  in  the  NIKL  mode,  the  result  is  still  a  NIKL  taxonomy  The 
only  difference  is  that  KREME  provides  a  better  environment  for  domain  model  development  For  further 
information  on  KREME.  consult  A/?£V/F  4  User's  Introduction.  BBN  Report  No.  6508. 


3.6  A  Brief  Domain  Model  fXampie 


This  section  is  intended  to  provide  some  insight  into  the  process  involved  in  developing  a  domain  mode!  for  a 
specific  application  Let  us  assume  that  we  wish  to  construct  a  domain  model  for  a  naval  application  In  this 
application  a  nmr.bc'  of  different  tvpes  of  objects  will  be  referred  to  Among  them  are 

•  helicopter 

•  airplane 

•  submarine 

•  earner 

•  ocean 
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•  port 

•  weapon 

Each  of  these  objects  will  be  represented  as  concepts.  If  we  did  nothing  more  than  that,  we  could  create  a 
network  whose  top  level  was  THING  and  whose  next  level  would  represent  each  of  these  objects  as  concepts 
However,  we  would  not  have  succeeded  in  captunng  any  generalizations  about  these  concepts  and  we  would  not 
have  established  any  inheritance  (beyond  the  fact  that  they  all  descended  from  THING)  which  we  could  later  use  in 
our  IRules.  Therefore,  we  would  wish  to  create  some  additional  concepts  which  expressed  generalizations  about 
these  objects. 

One  approach  would  be  to  sort  them  into  groups  and  then  to  create  concepts  that  represent  those  groups  For 
example,  one  might  create  a  group  from  helicopter,  airplane,  submarine,  and  earner,  and  call  that  concept 
VEHICLE  Intuitively,  one  feels  that  die  distinction  between  air  vehicles  and  sea  vehicles  might  he  a  useful  one  and 
so  might  create  the  concepts  AIRCRAFT  and  VESSEL.  These  would  each  be  subsumed  by  VEHICLE.  In  turn. 
AIRCRAFT  would  subsume  the  concepts  HELICOPTER  and  AIRPLANE,  and  VESSEL  would  subsume  tin- 
concepts  SUBMARINE  anu  CARRIER 

The  concepts  OCEAN  and  PORT  could  be  subsumed  under  a  concept  we  will  call  LOCATION 

The  concepts  READINESS  and  WEAPON  can  be  subsumed  by  THING. 

These  concepts  will  suggest  roles  that  arc  needed  to  express  their  attnbutes  These  roles  in  turn  will  require 
the  creation  of  additional  concepts  to  represent  their  ranges  For  example,  we  may  w  ish  to  describe  the  location  of  a 
vehicle  We  could  create  the  role  VEHICLE. LOCATION  with  the  domain  VEHICLE  and  the  range  LOCATION 
We  might  also  w  ish  to  refer  to  the  commander  of  a  VESSEL  We  would  create  the  role  COMMANDER. OF  and 
have  its  range  as  OFFICER  It  is  likely  that  before  we  finish  we  will  want  to  represent  other  people,  so  we  might 
subsume  OFFICER  under  the  concept  PERSON  All  of  these  domain  model  decisions  should  be  considered 
tentative  until  we  have  explored  the  domain  further 

One  way  to  futher  explore  the  domain  is  to  get  a  list  of  sentences  which  would  tvpicallv  be  input  by  the  user 
Let  us  assume  that  the  following  sentences  appear  in  this  list: 

What  is  tlx  overall  readiness  of  the  Enterprise 

What  is  the  Enterprise's  home  port ' 

(Enterpnse  is  the  name  of  a  earner; 

These  suggest  some  additional  concepts  and  roles  The  existing  concept  VESSEL  will  be  the  domain  of  the 
role  OVERALL  READINESS  This  will  have  a  new  conecpt.  perhaps  READINESS  CODE,  as  its  range  The  role 
HOME  PORT  might  N,-  cre.ued  wnf,  the  concept  VESSEL  as  ns  domain,  and  PORT  as  ns  range 
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Again,  these  domain  mouel  decisions  should  be  regarded  as  preliminary  until  it  is  established  that  the 
underlying  system  contains  the  information  necessary  to  respond  to  such  requests. 

The  domain  developer  would  proceed  to  deselop  a  domain  model,  using  anticipated  user  input  and  the 
information  contained  in  the  underlying  system  as  guides.  The  concepts  and  roles  in  this  preliminary  model  and  the 
anticipated  input  sentences  would  then  be  used  to  develop  the  IRules  and  the  dictionary  entnes.  This  process  would 
provide  feedback  on  the  appropnateness  of  the  domain  model.  The  domain  developer  should  be  prepared  to  make 
changes  to  the  domain  model  in  response  to  this  feedback. 
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4.  Semantic  Interpretation  Rules 


A  Semantic  Interpretation  Rule  (IRule)  consists  of  two  sections,  a  pattern  section  and  an  action  section.  The 
pattern  section  is  a  case  frame.  Constituents  of  a  query  may  fill  slots  in  a  case  frame  if  they  meet  the  selection 
restrictions  attached  to  that  slot.  The  action  secuon  contributes  to  the  logical  representation  of  the  query  .  The 
logical  representation  of  the  query  is  in  the  World  Model  Language  (W'ML).  described  in  chapter  6. 

The  Semantic  Interpretation  Rules  tIRules)  are  linked  to  particular  domain  model  concepts  or  roles  when  they 
are  defined  by  the  domain  developer.  They  are  then  activated  by  words  in  the  input  query.  whose  dictionary  entries 
link  them  to  corresponding  domain  concepts.  The  IRuIes  define,  for  a  particular  word  or  class  of  words,  the 
semantically  acceptable  English  phrases  or  clauses  that  can  occur  having  that  word  as  their  head.  They  also  define 
the  semantic  interpretation  of  an  accepted  phrase  or  clause.  Since  semantic  processing  is  integrated  with  syntactic 
processing  in  IRUS,  the  IRuIes  serve  to  block  a  semantically  unlikely  interpretation  as  soon  as  it  is  proposed  by  the 
parser. 

The  parser  passes  ("transmits”)  constituents  to  the  semantic  interpreter.  The  semantic  interpreter  attempts  to 
match  each  constituent  to  the  pattern  sections  of  the  IRuIes  associated  w  ith  the  head  word  of  the  current  phrase  If 
the  pattern  section  of  an  IRule  is  successfully  matched,  that  IRule's  action  sec'ion  is  queued  for  execution.  The 
action  sections  produce  the  fragments  of  W'ML  that  are  ultimately  composed  into  the  system's  logical  representation 
of  the  query.  The  action  sections’  secondary  function  is  to  place  additional  information  on  a  list  of  features  that 
accompany  the  partial  interpretations  and  which  other  IRuIes  may  access,  such  as  the  semantic  class  of  the 
constituent.  Failure  to  match  also  provides  useful  information  to  the  parser.  If  a  particular  parse  route  fails  to  find  a 
corresponding  semantic  interpretation,  then  the  parser  tries  another  parse  route. 

When  a  head  word  of  a  phrase  is  proposed  by  the  parser  to  the  semantic  interpreter,  all  IRuIes  that  can  apply 
to  the  head  word  for  the  given  phrase  type  are  gathered  as  follows:  for  each  domain  model  object  (concept  or  role) 
that  is  associated  with  the  word,  the  IRuIes  associated  w  ith  the  given  object  are  retrieved,  along  with  any  inhented 
IRuIes.  A  word  can  also  have  IRuIes  fired  directly  by  it.  without  involving  the  model.  Since  the  IRuIes 
corresponding  to  the  different  word  senses  may  give  rise  to  separate  interpretations,  they  are  carried  along  in  parallel 
as  the  processing  continues.  If  no  IRuIes  are  retrieved,  the  interpreter  rejects  the  word. 

One  use  of  the  domain  model  is  that  of  IRule  inheritance.  When  an  IRule  is  defined,  it  is  attached  to  a 
concept  in  the  domain  model  The  definer  of  the  IRule  also  has  the  opportunity  to  decide  whether  the  new  IRule  (the 
ha.se  IRule )  should  inherit  from  IRuIes  attached  to  higher  domain  model  concepts  (the  inherited  IRuIes).  or  possibly 
inherit  from  other  IRuIes  specified  by  the  definer  of  the  (Rule  When  a  modifier  of  a  head  w  ord  gets  transmitted  and 
no  pattern  for  it  exists  in  a  base  IRule  for  the  head  word,  higher  [Rules  are  searched  for  the  pattern  If  a  pattern  does 
exist  for  the  modifier  in  a  given  IRule.  no  higher  ones  are  tried  even  if  it  does  not  pass  the  local  semantic  test  That 
is.  inheritance  does  not  relax  semantic  constraints. 
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Generally  speaking.  IRule.s  are  attached  to  concepts  or  roles  in  the  domain  model  and  they  are  indirectly 
linked  to  words  in  the  lexicon  by  the  inclusion  of  the  names  of  domain  model  objects  in  the  semantics  of  each 
dictionary  entry  When  no  domain  object  seems  appropnate  as  a  semantic  entry  for  a  word,  IRules  may  be  linked  to 
the  word  directly  by  having  the  name  of  the  IRule  included  in  the  semantics  of  the  word. 


4.1  Pattern  Section 


Once  the  head  of  a  phrase  triggers  an  IRule,  all  of  its  modifiers  must  match  some  slot  in  the  pattern  section  of 
the  IRule.  If  the  matches  succeed,  the  action  section  of  that  IRule  is  queued  for  execution  at  the  time  the  constituent 
is  completed  ("constituent  pop  time"). 

There  are  two  types  of  IRules,  CLAUSE  IRules  and  NP  IRules.  CLAUSE  IRules  are  further  divided  into 
VERB,  PP.  and  ADJ  IRules.  (It  is  planned  that  PP  and  ADJ  IRules  will  in  the  future  be  separate  calegones 
altogether,  so  that  there  will  be  4  general  categones  of  IRules.)  Most  simple  cases  of  NP,  PP.  and  ADI  IRules  may 
now  be  defined  using  KNACQ  (chapter  7 ). 

4.1,1  Clause  IRules 

The  following  is  a  template  for  the  pattern  section  of  a  VERB  clause  IRule.  Its  head  is  a  verb,  and  the  IRule 
defines  the  case  frame  for  that  verb.  Standard  BNF  syntax  is  used 
(CLAUSE 


HEAD 

* 

(SUBJECT 

<test> } 

(OBJECT 

<test> } 

{ INDOBJ 

<test> } 

(PP 

( (PP  HEAD 

<P repos it ion-test > 

POBJ 

<test>)  +)  } 

( THATCOMP 

<test> ) 

{ <ot he r -verb -modi f ier-label>  <test>) * 

) 

CLAUSE  means  that  this  IRule  is  designed  to  handle  clause-level  constituents,  either  sentences  or  subordinate 
te  g  relative)  clauses  NP  IRules  would  handle  the  NP  subconstituents  of  the  sentence  li  is  helpful  to  think  of  the 
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clause  IRules  as  the  higher  level  IRules  that  bind  together  subinterpretations  produced  by  the  NP  IRules.  Following 
’’CLAUSE''  is  a  property  list,  of  <parser-label>  and  <test>  pairs,  each  pair  representing  a  selectional  restriction. 
Every  structure  "transmitted'  by  the  parser  to  the  interpreter  has  a  parser-label:  the  test  places  semanuc  constraints 
on  the  structure  transmitted  with  the  given  label. 

HEAD  is  a  slot  that  must  be  filled  by  the  head  of  the  constituent.  The  HEAD  of  a  (non  predicate-adjective) 
sentence  is  its  verb,  the  head  of  an  NP  is  a  noun,  the  head  of  a  sentence  with  a  predicate  adjecuve  ("<NP>  <be> 
<adjective>”)  is  the  adjective  (e  g.,  the  parser  considers  "pretty"  as  the  head  in  "the  girl  is  pretty"),  and  the  head  of  a 
sentence  with  a  predicate  PP  is  the  copular  "BE"  (e.g..  "the  ship  IS  in  Hawaii").  Since  the  semantic  properties 
attached  to  a  specific  verb  trigger  clause  IRules  during  the  interpretation  of  a  query,  there  are  typically  no  additional 
restrictions  on  the  HEAD.  This  is  represented  by  the  asterisk  associated  with  the  HEAD  slot. 

The  SUBJECT  of  a  clause  is  usually  a  noun  phrase.  In  order  to  be  accepted  as  a  filler  for  the  SUBJECT  slot,  a 
noun  phrase  must  meet  the  test  associated  with  that  slot.  Section  4. 1.5  presents  the  syntax  of  these  tests. 

The  OBJECT  of  a  clause  is  the  direct  object  of  the  (transitive)  verb. 

The  IWDOBJ  is  the  indirect  object.  For  example,  the  verb  s/icm  may  take  an  indirect  object: 

Show  me  the  ships  in  the  Indian  Ocean. 

The  direct  object  of  show  is  ships  and  the  indirect  object  is  me. 

Verbs  may  be  modified  by  prepositional  phrases.  The  PP  slot  of  a  clause  IRule  would  be  filled  by  such 
prepositional  phrases.  This  slot  may  not  be  tilled  by  a  prepositional  phrase  that  modifies  a  noun.  These  are  handled 
in  NP  IRules  or  PP  (predicate  prepositional  phrase)  IRules.  The  HEAD  of  a  prepositional  phrase  is  the  preposition 
and  the  POB1  is  the  object  of  the  preposition.  There  are  two  tests  on  prepositional  phrases;  one  on  the  head 
preposition  and  another  on  the  object  of  the  preposition. 

The  PP  slot  may  be  repeated  because  more  than  one  PP  may  modify  a  clause.  Repeated  slots  are  implicitly 
assigned  numbers  that  are  later  used  in  the  action  secuon.  For  example,  the  current  IRule 
CH.\JSGE.IN  READINESS  2  (tnggered  by  the  verb  change)  has  these  PP  slots  associated  with  it: 


(PP 

HEAD 

FROM 

POBJ 

(IS-A 

C . READINESS . CODE) ) 

(PP 

HEAD 

TO 

POBJ 

(IS-A 

C. READINESS. CODE) ) 

In  the  action  section,  these  are  distinguished  from  each  other  by  references  to  PP  1  and  PP  2.  Different  WML 
representations  are  built  for  each  case. 

THATCOSfP  tests  for  a  THAT  complement,  as  in  "Frederick  reported  that  Vinson  downgraded 

There  may  be  selectional  restnctions  for  any  other  clause  modifier  such  as  TOCOMP  (TO  complements)  that 
the  parser  may  transmit 
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4.1.2  NP  {Rules 


The  format  for  the  pattern  section  of  an  NP  [Rule  looks  like  this: 


(HP 


HEAD 

{HP 

(PP 

{H 

{ADJ 

(PRESPART 
{ PASTPART 
{PREHOMIHAL 
) 


(<test>+)  } 

( (PP  HEAD  <Preposition-test> 
POBJ  <test>)  +)  } 

(<test>+)  } 

(<test>+)  } 

(<test>+)  ) 

(<test>+)  } 

(<test>+)  } 


NP  means  the  IRule  is  designed  to  handle  parts  of  noun  phrases.  A  noun  phrase  includes  a  noun  plus  all  its 
modifiers  such  as  articles,  adjectives,  noun  modifiers,  and  prepositional  phrases. 


The  HEAD  of  a  noun  phrase  is  the  noun  itself.  Since  the  noun  would  typically  have  the  appropnate  semantic 
property  to  trigger  the  IRule,  usually  there  are  no  restrictions  on  the  HEAD.  This  is  expressed  with  the  asterisk.  It  is 
possible,  however,  to  put  a  test  on  the  head  of  an  NP  IRule. 


NP  is  for  noun  phrase  modifiers  to  the  head  noun.  NP,  like  the  other  selectional  restrictions  presented  so  far 
(except  for  HEAD),  tests  full  (interpreted)  constituents.  Interpreted  constituents  have  been  interpreted  as  phrases 
and  have  already  tnggered  IRules  of  their  own.  Interpreted  constituents  already  have  a  partial  WML  and  semantic 
class  associated  with  them.  For  example,  an  NP  selectional  restriction  could  accept  an  NP  constituent  for  USA'' 
modifying  "ships"  in  the  noun  phrase  'USA  ships",  as  well  as  accepting  "friendly  country"  in  "friendly  country 
ships".  An  algorithm  for  noun  compounds  in  IRUS-D  tries  to  break  up  a  sequence  of  nouns  into  component  NPs. 

Other  modifiers  of  nouns,  whether  prepositional  phrases  PP,  adjectives  ADJ.  past  participles  PASTPART  (e  g  . 
"the  crossed  nver  ")  or  present  participles  PRESP.ART  (e  g.,  "the  deploying  commander"),  are  normally  handled  by 
PP  rules,  adjective  rules  or  verb  clause  rules,  respectively  .  The  only  reasons  to  put  tests  for  these  modifiers  directly 
in  an  NP  IRule  are  ( 1  >  to  handle  idiomatic  readings  (  the  red  (Russian)  forces"),  or  (2)  because  the  modifier  never 
occurs  in  predicate  position,  like  "of  the  ship"  in  captain  of  the  ship"  but  not  (*  ("captain  is  of  the  ship".  Only  if  the 
meaning  of  a  modifier  is  different  when  it  is  in  a  noun  phrase  from  when  it  is  a  predicate  modifier  should  a  test  for 
this  modifier  be  put  directly  in  the  NP  rule.  Otherwise  it  need  not  be  mentioned  in  the  NP  [Rule.  For  example,  the 
UNIT.BE. PREDICATE.  I  PP  IRule  handles  both  List  the  ships  in  port"  and  "List  the  ships  that  are  in  port 
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The  PP  is  as  for  CLAUSE  Irules 

It  should  he  noted  that  .V,  ADJ.  PRESPART.  PASTPART,  and  PRENOMISAL  test  for  uninterpreted 
constituents.  An  uninterpreted  constituent  does  not  carry  a  semantic  interpretation  with  it.  Thus  if  an  uninterpreted 
constituent  is  bound  to  a  variable  in  an  IRule  and  used  in  constructing  a  WML  predicate,  the  lexical  head  itself  is 
what  will  appear  in  the  WML  in  place  of  the  IRule  variable.  In  the  case  of  interpreted  constituents,  however,  a 
term-a  IOTA  expression  or  a  variable  bound  by  an  outer  scoping  expression  which  is  the  interpretation  of  the 
transmitted  constituent-is  what  is  replaced  for  the  IRule  variable.  PRENOM1NAL  is  a  super-label;  it  accepts  Ns. 
ADJs.  PRESPARTs,  and  PASTPARTs 

4.1.3  Adjective  IRules 

Adjective  clause  IRules  are  triggered  when  an  adjective  is  used  in  predicate  position  (e.g..  "the  girl  is  pretty ", 
"the  girl  that  is  pretty"),  where  the  adjective  (e.g..  pretty)  is  considered  the  head  of  the  clause.  Also  when  an 
adjective  appears  directly  modifying  a  noun  (e.g..  "the  pretty  girl"),  the  semantic  interpreter  tries  to  use  the  general 
meaning  of  the  adjective  encoded  in  the  appropriate  adjective  rule  to  obtain  an  interpretation,  by  interpreting  it  as  a 
predicate  adjective  in  a  relative  clause  ("the  girl  that  is  pretty  ").  (In  this  NP  case,  die  interpreter  will  first  try  an 
idiomatic  interpretation  by  checking  the  head's  NP  IRule  for  an  ADJ  test,  and  will  keep  this  result  as  a  first  option  in 
the  list  of  possible  NP  interpretations.) 

The  following  is  a  template  for  the  pattern  section  of  an  adjective  clause  IRule: 

(CLAUSE 

HEAD  * 

SUBJECT  <test> 

(PP  ( (PP  HEAD  <Preposition-test> 

POBJ  <te»t>) +)  ) 

(THATCOMP  <test>) 

(<other-verb-inodifier-label>  <test>)  * 

) 

HEAD  is  the  adjective,  which  is  considered  head  of  the  clause  when  it  appears  in  predicate  position. 

SUBJECT  tests  the  NP  being  modified  by  the  adjective. 

PP  and  other  labels  are  as  for  verb  clause  IRules.  They  arc  present  when  the  ADJ  takes  complements  For 
example,  "afraid"  takes  a  THATCOMP  in  "I  am  afraid  that  I  will  fail"  and  mission  area  ratings  take  PP 
complements,  as  in  "Frederick  is  Ml  on  ASW" 
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4.1.4  Prepositional  Phrase  IRules 

Prepositional  phrase  clause  IRules  are  triggered  when  a  PP  is  used  in  predicate  position,  e  g.,  "the  ship  is  in 
Hawaii",  "the  ship  that  is  in  Hawaii".  An  attempt  is  made  to  handle  PPs  analogously  to  ADJs,  since  they  both  occur 
in  predicate  position  as  well  as  directly  in  noun  phrases  modifying  the  head  NP,  but  both  syntactic  positions  yield 
the  same  meaning.  Thus,  as  in  the  ADJ  case,  when  a  PP  appears  directly  modifying  a  noun  (e.g.,  "the  ship  in 
Hawaii"),  the  semantic  interpreter  tries  to  use  the  appropriate  PP  rule  to  obtain  an  interpretation  by  interpreting  it  as 
a  predicate  PP  in  a  relative  clause  after  first  trying  an  idiomatic  interpretation.  If  the  KNACQ  system  is  in  use.  the 
interpretation  of  a  direct  PP  modifier  as  a  relative  clause  PP  does  not  happen  with  KNACQ,  separate  PP  rules  need 
not  be  defined,  it  handles  the  PPs  automatically  when  an  attribute  or  case  rule  is  defined 

A  difference  from  the  ADJ  treatment  is  that  the  parser  does  not  consider  the  PP  to  be  the  head  of  the  phrase 
when  in  predicate  position,  but  insists  to  have  "BF"  as  the  head.  Therefore  PP  IRules  must  be  tnggered  by  "BE". 
Currently,  PP  IRules  must  be  all  inherited  by  the  default  IRule  NP. BE. PREDICATE.  We  plan  to  change  the  way 
PPs  are  treated  to  be  more  analogous  to  ADJs. 

Following  is  a  schema  for  the  pattern  portion  of  PP  clause  IRules: 

(CLAUSE 


HEAD  BE 

SUBJECT  <test> 

PREDICATE  ( (PP  HEAD  <Preposition-test> 

POBJ  <test>)+) 

) 

HEAD  must  be  "BE". 

SUBJECT  tests  the  NP  being  modified  b\  the  prepositional  phrase. 

PREDICATE  has  the  syntax  of  PP  in  the  other  IRule  types.  It  tests  for  the  PPs  that  should  trigger  this  IRule. 
More  than  one  PP  may  be  listed  in  the  same  IRule.  as  long  as  the  seiechonal  restriction  on  what  they  modify  (the 
SUBJECT )  is  the  same 
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4.1.5  Selectional  Restriction  Tests 


A  test  associated  with  a  label  in  an  active  IRule  is  applied  to  any  constituent  transmitted  with  that  label  If  the 
constituent  fails  the  test,  that  particular  parse  path  is  aborted  by  the  parser,  and  another  path  is  tned.  If  there  is  no 
test  for  the  transmitted  label,  inherited  IRules  will  be  tned  to  accommodate  the  transmitted  constituent.  If  no 
inherited  IRule  accepts  it.  it  is  rejected.  When  a  transmitted  constituent  is  accepted,  it  is  saved  to  be  used  by  the 
action  portion  of  the  IRule  w  hen  the  current  phrase  is  finished 

An  IRule  label  may  be  a  unique  item  label  if  only  one  constituent  may  fill  that  slot  (e.g.,  SUBJECT),  or  it  may¬ 
be  a  multiple  item  label  if  it  can  accept  more  than  one  filler  (e  g..  PP.  PRENOMINAL).  When  the  label  is  a  unique 
item  label,  it  is  associated  with  a  single  test  ir.  the  IRule.  else  a  list  of  tests  is  used,  indicating  the  different  possible 
fillers. 

A  test  associated  with  an  IRule  label  is  either  a  base  test,  or  a  path  specification  containing  base  tests  in  it. 
For  example, 

(PP  HEAD  (ONE-OF  OF  FROM)  POBJ  (IS-A  COUNTRY) ) ) 
is  a  path  specification  with  two  base  tests:  (ONE-OF  OF  FROM)  and  (IS-A  COUNTRY),  testing  for  the  PP  head 
and  its  POBJ,  respectively.  Using  general  path  specifications,  except  for  the  usual  PP  schema  as  above,  is  not 
recommended. 

There  are  several  types  of  base  tests  that  can  be  associated  w  ith  labels  in  an  IRule. 

•  Simple  semantic  test:  Normally  of  the  form  (IS-A  <domain-concept-name>).  A  constituent  passes  this 
test  if  its  semantic  class  is  subsumed  by  "  subsumes  the  concept  denoted  by  <dom?an-concept- 
name>  This  IS-A  is  really  a  two-way  subsumption  test  in  order  to  allow  more  general  concepts  to  fill 
the  semantic  constraint,  as  is  necessary'  for  pronouns,  for  example.  In  "He  deployed  the  ship",  "he"  hxs 
semantic  class  "PERSON",  a  superconcepi  of  "COMMANDER",  which  the  verb  "deployment"  requires 
as  subject.  IS-A-SUPERC  instead  of  IS-A  may  be  used  if  strict  one-way  subsumption  is  required. 

•  Exact  match:  A  lest  for  an  exact  match  w  ith  a  constituent  may  be  done  in  one  of  three  ways: 

•  (EXACTLY  <word>)  will  test  whether  the  root  of  the  head  of  the  constituent  is  <word> 

•  <word>  will  test  whether  the  unimerpreteJ  constituent  is  equal  to  <word>.  Used  to  test  the 
preposition  head  of  a  PP,  for  example. 

•  (ONE-OF  <wordl>  <word2>..  <wordn>i  is  like  <word>.  hut  is  used  when  more  than  one  option 
is  acceptable. 

•  Arbitrary  test:  of  the  form  (TEST  <aibitrary-test>).  <Arbitrary  test>  may  be  a  simple  boolean 
combination  (using  NOT.  AND,  OR)  of  simple  IS- A  tests,  or  may  be  a  function  call  to  an  arbitrary 
function.  Use  of  arbitrary  functions  is  not  recommended. 

•  Automatic  success,  if  the  constituent  should  be  accepted  regardless.  T  or  (TEST  T)  may  be  used. 
Another  test  which  immediately  succeeds  is  *.  but  this  should  only  be  used  for  IRule  heads,  because 
constituents  approved  via  *  cannot  be  replaced  by  pronouns 

•  Forced  failure  and  block:  to  cause  immediate  failure  when  a  constituent  is  present,  and  prevent 
inhented  rules  to  be  tned  alter  the  failure,  'TEST  NILi  may  be  used 
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4.2  Action  Section 

The  primary  function  of  the  action  section  of  an  IRule  is  to  contribute  to  the  production  of  the  WML  that  will 
represent  the  phrase  which  triggered  the  IRule.  Another  function  is  to  specify  any  modifiers  that  may  be  required 
(not  optional)  for  the  IRule  to  fire. 

The  entire  action  section  of  an  IRule  is  a  macro  call.  In  a  clause  IRule,  it  is  a  call  to  the  macro 
BINDQUANTS:  in  an  NP  IRule  it  is  a  call  to  the  macro  LIFTQUANTS,  These  macro  names  are  meant  to  be 
mnemonic:  quantified  expressions  with  possible  "holes'  in  them  (to  be  filled  by  other  parts  of  the  sentence) 
stemming  from  NPs  are  earned  ("lifted")  until  a  clause  boundary  is  reached,  at  which  point  they  are  "bound" 
together  with  the  clause's  contnbution.  i.e.,  all  the  holes  are  filled  and  the  quantification  structure  of  the  clause 
interpretation  is  determined.  These  macros  take  as  the  first  argument  a  variable  binding  list  The  other  arguments 
(the  body  of  the  action)  are  actions  to  be  performed  using  the  bound  variables. 

4.2.1  V  ariable  Binding  List 

The  first  argument  to  LIFTQUANTS  and  BINDQUANTS  in  the  action  part  of  an  IRule  is  a  variable  binding 
list.  Variable  names  are  bound  to  fillers  of  constituent  labels  mentioned  in  the  left  hand  side  of  the  IRule. 

The  syntax  of  the  bindings  list  is 

( (<varl>  <constituer.t-de script  ionl>) 

(<var2>  <constituent-description2>) 

(<varn>  <constituent-descriptionn>) ) 

Where  the  syntax  of  a  <consutuent-descnption>  may  be  given  by  the  following  context  free  definition,  where 
I  signifies  "or '.  (  ]  is  for  optional  items,  and  a  superscript  of  +  means  one  or  more  repetitions: 

constituent-description  ->  cl  |  (OPTIONAL  cl)  |  (DEFAULT  cl  <default>) 

|  (LISTOF  Cl) 
cl  ->  (OR  c2+)  |  c2  |  c3 

c2  ->  c3  |  (PROPERTY  <c3>  <semantic-prop>)  |  (FULL-INTERP  <c3>) 
c3  ->  (or  <path>+)  |  path 

OPTIONAL  means  that  even  if  the  specified  label  has  not  been  filled,  the  IRule  execution  is  still  allowed  to 
complete.  The  variable  is  assigned  value  “ABSENT**  and  any  action  in  the  body  using  the  variable  is  ignored 

DEFAULT  allows  for  a  default  value  <default>  to  be  used  as  the  variable  value  if  the  specified  label  is  not 
filled  OPTIONAL  and  DEFAULT  allow  for  a  label  filler  to  be  missing  If  neither  of  these  is  used,  the  filler  is 
considered  required  If  a  required  filler  is  not  present,  the  IRule  execution  fails,  i.e..  an  interpretation  using  this 
IRule  is  no  longer  considered  (If  it  was  the  only  IRule.  the  constituent  "pop"— finishing  the  phrase-will  fail,  and 
the  parser  will  have  to  try  another  parse  path.  >  LIST*  >F  obtains  a  hrt  of  all  possible  fillers  of  the  description 
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OR  states  thai  any  of  the  specified  labels  may  be  used  to  bind  the  variable.  The  first  label  from  the  list  that 
has  a  filler  is  used  (except  in  the  context  ot  a  L1STOF,  where  all  are  used). 

<path>,  in  its  simplest  form,  is  just  a  label,  e.g.  SUBJECT.  It  may  be  a  more  complex  path,  as  is  the  cxse  with 
muluple  fillers  for  a  label  such  as  PRENOMINAL.  In  this  case  a  path  would  be  (PRENOMINAL  <n>),  where  <n> 
is  an  integer,  indicating  the  nth  PRENOMINAL  test  from  the  left  hand  side  of  the  IRule.  Complex  paths  for  PPs  are 
typical,  e.g.  (PP  1  POBJ)  meaning  the  object  of  the  first  PP. 

PROPERTY  takes  a  label  or  path  and  a  domain  model  object  name  (a  concept  or  role).  The  semantics  field  (in 
the  lexicon)  for  the  head  of  the  label  filler  is  searched,  and  if  the  object  name  is  in  the  semantics,  the  variable  is 
bound  to  that  semantic  property’s  value  (usually  *,  or  something  complex  like  (NAMEOF  "FREDERICK")  or 
(CODE  6A)i.  A  variant  of  this  is  using  (ANY  <semantic-prop>)  instead  of  <semantic-prop>.  When  ANY  is  used, 
any  superconcept  (or  superrole)  of  <semantic-prop>  (including  <semantic-prop>  itself)  is  used  if  it  is  found  in  the 
head's  semantics. 

FULL-INTERP  is  used  when  one  wants  to  bind  the  variable  to  the  whole  WML  interpretation  of  the  label 
filler.  FULL-INTERP  should  only  be  used  for  intensional  arguments.  For  example  the  CHANGE  ERule  uses 
(FULL-INTERP  SUBJECT )  to  bind  an  IRule  vanable  to  the  subject,  in  order  to  apply  the  INTENSION  operator  to 
it  in  its  interpretation.  Normally  the  IRule  variables  get  bound  to  a  IOTA  term  (in  the  definite  NP  case),  to  a  SEFOF 
expression  (in  the  case  of  an  NP  conjunction),  or  to  a  variable  that  is  bound  by  the  scoping  WML  expression 
generated  for  the  label  filler.  FULL-INTERP  causes  all  partial  WML  pieces  that  were  being  ’  lifted''  to  be  "bound" 
into  a  closed  expression  This  closed  expression  is  made  the  value  of  the  IRule  vanable 

4.2.2  IRule  Body 

The  body  of  the  action  part  of  the  IRule  is  composed  of  individual  actions  (function  calls)  that  add  bits  of 
WML  to  the  interpretation  of  the  current  phrase  that  is  being  formed.  Normally  one  scoping  expression  is  generated 
per  NT  or  clause.  The  primary  (unction  of  the  actions  in  the  body  is  to  specify-  the  sort  (semantic  class)  of  the 
variable  to  be  used  in  the  scoping  expression  and  to  specify  predicates  that  will  be  gathered  for  the  scoping 
expression’s  body. 

Typical  actions  that  may  be  found  in  IRules: 

( ANDPREDICATE  ’(<rolename>  <varl>  <var2>>)  :  Adds  the  given  predication  to  the  conjuncts  that  are 
being  gathered  for  the  expression's  body.  *V*  is  used  to  refer  to  the  local  variable  of  the  scoping  expression  under 
construction  ANDPRED1CATE  is  used  in  NP  IRules. 

(SEM  ’(<rolename>  <\arl>  <var2>))  analogous  to  ANDPREDICATE.  SEM  is  used  for  CLAUSE  IRules. 

(CLASS  <coneeptname>)  :  specifies  (he  semantic  son  to  be  used  in  the  quantified  expression. 
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(LOCALQUANT  '<det>)  :  Where  <det>  may  be  SOME  or  THE.  which  makes  the  branch  category  of  the 
scoping  expression  be  EXISTS  or  IOTA,  respectively.  For  NP  rules.  LOCALQUANT  is  not  necessary,  since  the 
branch  category  is  determined  by  the  determmer  used  in  the  noun  phrase.  For  CLAUSE  IRules.  this  should  be 
SOME  if  a  quantified  expression  is  desired  for  the  CLAUSE.  If  only  predications  are  desired.  SEMs  may  be  used 
alone,  or  (NOLOCALQUANT)  may  be  used.  (We  will  probably  remove  the  need  to  use  LOCALQUANT  at  all 
soon.) 

(IF-BOUND 

(<varl>  <ac tionl  1>  <actionl2>...) 

(<varn>  <actionnl>  <aetionn2>...) 

(otherwise  <actionl>  <action2>...)) 

IF-BOUND  is  analogous  to  a  LISP  CASE  statement.  It  executes  the  actions  in  the  first  branch  that  names  a 
bound  [Rule  variable. 

The  following  are  two  utility  functions  that  may  be  used  as  part  of  an  IRule  action 

(GET-IS-A-SEMPROP  ’<word>  ’<domain  objectname>)  :  This  function  searches  the  semantics  field  of 
<word>  in  the  lexicon,  try  ing  to  find  a  semantic  property  (a  "sense")  of  the  word  which  names  a  domain  object  equal 
to  or  subsuming  the  object  named  by  <objectname>.  It  returns  the  word  sense  that  matched.  CET-IS-A-SEMPROP 
is  usually  used  to  find  the  most  specific  semanuc  property  of  <word>,  which  is  know  n  to  be  a  subconcept  or  subrole 
of  <domain  objectx 

(MAKE-IRULE-SCOPING-F.XPRESSION  '<predicationl>  ...  '<predicationn> 

:class  <sort>  :branch-cat  <branch-category>) 

If  more  than  one  quantified  expression  is  absolutely  required  to  be  generated  for  one  IRule,  MAKE-IRULE- 
SCOPING-EXPRESSION  creates  one  that  may  be  used  as  an  argument  to  an  ANDPREDICATE  or  SEM.  or  as  part 
of  another  predication.  <Predication>  may  be  in  terms  of  a  special  variable  *E*  which  refers  10  the  local  variable  to 
be  created  for  that  expression. 

4.2.3  IRule  Examples 

This  section  provides  an  analysis  of  tw  o  IRules:  one  is  an  example  of  an  NP  IRule  and  the  other  a  clause 

IRule. 

4.2.3. 1  An  NP  IRule 


jo 
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(IRule 

(IRule  BATTLE . GROUP . 1 
(NP  HEAD  * 

PP  ((PP  HEAD  OF  POBJ  (IS-A  UNIT)) 

(PP  HEAD  IN  POBJ  (IS-A  LOCATION . IN . SPACE) )) ) 

(LIFTQUANTS  ((UNIT.l  (OPTIONAL  (PP  1  POBJ))) 

(PLACE. 1  (OPTIONAL  (IP  2  POBJ)))) 

(ANDPREDICATE  '(IN. PLACE  *V*  PLACE. 1)) 

(ANDPREDICATE  '(UNIT. GROUP  UNIT.l  *V*)  ) 

(CLASS  BATTLE . GROUP ) 

SUPERRULE  T  CONCEPT -I Rule -ATTACHED -TO  BATTLE . GROUP )) ) 

The  head  of  a  noun  phrase  handled  by  this  [Rule  can  be  modified  by  either  or  both  of  two  types  of 
prepositional  phrase.  The  pattern  for  PP  1  accepts  a  prepositional  phrase  w  ith  the  preposition  "of  and  an  object  that 
is  a  "UNIT”.  PP  2  accepts  a  prepositional  phrase  whose  preposition  is  "in'  and  whose  object  is  a 
"LOCATION. IN. SPACE '.  UNIT  and  LOCATION. IN. SPACE  are  concepts  from  the  domain  model. 

If  a  constituent  matches  this  pattern,  then  the  action  section  will  be  executed.  The  action  section  binds  the 
optional  object  of  PP  I  to  the  variable  UNIT.  1 .  It  also  binds  the  optional  object  of  PP  2  to  the  variable  PLACE  1 . 

Two  ANDPREDICATE  functions  then  add  predications  to  .  used  in  the  body  of  the  scoping  expression  to  be 
created  *V*  is  a  variable  which  is  automatically  bound  to  the  local  variable  that  is  created  for  this  expression 

(CLASS  BATTLE. GROUP)  sets  the  sort  field  of  the  expression,  and  labels  this  constituent  as  having  class 
BATTLE. GROUP. 

If  the  input  NP  is  "the  battle  group  in  the  Indian  Ocean",  the  determiner  "the"  w  ould  result  in  a  IOTA  branch 
category,  and  the  partial  WML  interpretation  w'ould  be: 

(IOTA  ?JX1  BATTLE. GROUP 

(IN. PLACE  ?JX1  <"Indian  Ocean"  interp>) ) 

Since  there  was  no  "OF  PP.  the  UNIT.  1  IRule  variable  was  not  bound  and  the  ANDPREDICATE  using  it  was 

ignored. 

4. 2. 3. 2  A  Clause  IRule 

(IRule 

(IRule  DEPLOYMENT 

(CLAUSE  SUBJECT  (IS-A  COMMANDING . OFFICER)  HEAD  *  OBJECT  (IS-A  UNIT) 

PP  (  (PP  HEAD  TO  POBJ  (IS-A  LOCATION . IN . SPACE) )) ) 

(BINDQUANTS  ((CO.l  (OPTIONAL  SUBJECT))  (UNIT.l  (OPTIONAL  OBJECT)) 

(TO . 1  (OPTIONAL  (PP  1  POBJ)))) 

(LOCALQUANT  'SOME) 

(CLASS  DEPLOYMENT) 

(SEM  '(AGENT  *V*  CO.l)) 

(SEM  ' (OBJECT  *V*  UNIT.l)) 

(SEM  ' (DEPLOYED .TO  *V*  TO.l)))) 


31 


BBN  System!*  and  Technologies  Corporation 


Report  No.  7144 


The  head  of  the  clause  handled  by  this  IRule  must  have  a  subject  which  is  a  "COMMANDING. OFFICER".  If 
it  takes  a  direct  object,  it  must  be  an  object  which  is  a  'UNIT".  The  verb  mas  also  be  modified  by  a  prepositional 
phrase  If  it  is,  the  preposition  must  be  "to"  and  the  object  of  the  preposition  must  be  a  "LOCATION  EN. SPACE 
COMMANDING. OFFICER.  UNIT,  and  LOCATION  IN. SPACE  are  concepts  from  the  domain  model. 

If  a  constituent  matches  this  pattern,  then  the  action  section  is  executed  Because  this  is  a  clause  IRule.  the 
action  section  is  a  BINDQUANTS  macro.  The  action  section  binds  the  subject  to  the  vanable  ACTIVE.ENTITY.L 
It  also  binds  the  direct  object  to  the  variable  UNIT.  1  and  the  object  of  the  preposition  to  the  variable  TO  1 . 


If  the  input  was  "An  admiral  deployed  Frederick  to  the  Indian  Ocean",  the  piece  of  WML  built  by  this  [Rule 
would  be: 

(EXISTS  ?JX1  DEPLOYMENT 

(AGENT  ?JX1  <"an  admiral”  interp>) 

(OBJECT  ?JX1  <” Frederick"  interp>) 

(DEPLOYED. TO  ?JX1  <" Indian  Ocean”  interp">) ) 


4.3  Creating  IRulcs 


Developers  who  are  building  [Rules  for  a  new  domain  or  extending  the  IRules  in  a  given  domain  may  use  the 
IRules  Acquisition  interface  (IRACQi  (best  suited  for  defining  verbs).  KNACQ  (chapter  7i.  or  wnte  the  rules 
directly  into  an  editor. 

Creating  [Rules  involves  constructing  a  case  frame  pattern  section  for  a  query  constituent  and  a  template  to 
build  a  WAIL  representation  of  the  query  taction  section)  Since  the  WAIL  representation  of  the  query  uses  terms 
from  the  domain  model,  the  IRule  developer  must  understand  the  domain  model  for  a  particular  domain  before 
writing  IRules  for  that  domain 

4.3.1  IsinulRACQ 

IRACQ  is  a  tool  that  a  developer  can  use  to  add  new  IRules  to  a  system.  An  alternative  to  IRACQ  is  to 
directly  enter  and  edit  an  IRule  in  an  editor  The  advantage  of  IRACQ  is  that  it  prompts  the  developer  and  leads  him 
or  her  through  an  interactive  session  that  makes  the  creation  of  new  IRules  a  relatively  easy  task  IRules  are  defined 
by  the  user  using  sample  phrases:  sample  noun  phrases  for  NT  IRules.  sample  sentences  for  verb  clause  IRules 
(with  the  verb  as  head ».  sample  sentence'  with  a  predicate  adjective  iwith  adjective  as  head)  for  ADJ  clause  IRules. 
and  sample  sentences  with  predicate  PPs  (with  BE"  as  head)  for  PP  clause  IRules.  It  is  possible  that  the  developer 
mas  wish  to  create  a  complex  IRule  that  would  not  be  supported  via  IRACQ  Even  when  the  developer  chooses  to 
use  IRACQ.  the  option  <>!  editing  the  resulting  IRule  is  provided. 
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4.3.2  An  IRACQ  session 

In  this  section  we  step  through  the  definition  of  a  clause  IRule  for  the  word  send  We  assume  that  levied 
information  about  "send  has  already  been  entered. 

The  user  enters  the  word  that  he  wishes  to  create  an  IRule  for.  the  name  of  the  domain  model  concept  he' 
wishes  to  connect  the  IRule  to.  and  a  sentence  using  the  new  word.  From  the  example  presented  by  the  user. 
IRACQ  must  learn  which  unary  and  binary  predicates  are  to  be  used  to  obtain  a  representation  Furthermore, 
IRACQ  must  acquire  the  most  general  semantic  class  to  which  the  fillers  of  the  slots  must  belong 

Output  from  the  system  is  shown  in  bold  face,  input  from  the  user  in  regular  face,  and  comments  are  inserted 
in  parentheses 

Word  that  should  trigger  this  IRule:  send 
Domain  model  term  to  connect  IRule  to  deployment 

Enter  an  example  sentence  using  "send": 

An  admiral  sent  Enterprise  to  die  Indian  Ocean 

Choose  a  generalization  for  COM MANDI.NG. OFFICER 

COMMANDING. OFFICER  PERSON  CONSCIOl  S.BF.ING  ACTIVE. ENTITY  OBJECT  THING 

(The  user's  selection  specifies  the  case  frame  constraint  on  the  logical  subject  of  send  The  user  picks 
COMMANDING. OFFICER  IRACQ  will  perform  similar  inferences  and  present  a  menu  for  the  other  cases  in  the 
example  phrase  as  well,  asking  each  time  whether  the  modifier  is  required  or  optional.  Assume  that  the  user  selects 
UNIT  as  the  logical  object  and  REGION  as  the  object  of  the  preposition  "to",  i 

Which  of  the  following  predicates  should  relate  DEPLOYMENT  to  REGION  in  the  \\  ML?: 

LOCATION. OE’  DESTINATION. OF 

(IRACQ  presents  a  menu  of  binary  predicates  relating  DEPLOYAIENT  and  COMMANDING. OFFICER,  anil 
one  relating  DEPLOYMENT  and  UNIT  The  user  picks  AGENT  and  OBJECT,  respectively  ) 

Enter  examples  using  "send"  or  <CR>  if  done: 


(The  user  may  provide  more  examples  Redundant  information  would  be  recognized  automatically  • 

Should  this  IRule  inherit  from  higher  IRuIex?  vex 


This  is  the  IRule  you  just  defined: 
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(IRule  DEPLOYMENTS 

(clause  subject  (is-a  COMMANDING. OFFICER) 

head  * 

object  (is-a  UNIT) 

pp((pp  head  to  pobj  lis-a  REGION;))) 

==> 

(bindquants  ((commanding.officer.l  (optional  subject); 
(unit.l  (optional  object)) 

(region.l  (optional  (pp  1  pobj)))) 

(andpredicate  '(destination.of  *v*  region.  I )) 
(andpredicate  ’(object.of  *v*  unit.l )) 

(andpredicate  ’(agent  *v* commanding.officer.l); 
(localquant  ’SOME) 

(class  ’DEPLOYMENT))) 


Do  you  wish  to  edit  the  IRule?  no 

(The  person  may.  tor  example,  want  to  lasert  something  in  the  action  part  of  the  IRule  that  w  as  not  covered  by 
the  IRACQ  questions. ) 

The  sense  of  sending"  we  have  defined  specifies  an  event  type  whose  sorted  lambda  representation  is  as 
follows: 

(<  Xx  deployment) 
a^entix.  at  <&  object! x.  o>  & 
destination! x,  dn 

where  the  agent  j  must  be  a  commanding  officer,  the  object  o  must  be  a  unit  and  the  destination  d  must  be  a 
region. 

This  concludes  our  sample  IRACQ  session. 


4.3.3  Debugging  environment 

The  facility  for  creating  and  extending  LRules  is  integrated  with  the  1RUS  NLI  itself,  so  that  debugging  can 
commence  as  soon  as  an  addition  is  made  using  IRACQ  The  debugging  facility  allows  one  to  request  1RUS  to 
process  any  input  sentence  un  one  of  several  modes:  asking  the  underlying  system  to  fulfill  the  user  request, 
generating  code  for  the  underlying  system,  generating  the  semantic  representation  only,  or  parsing  without  the  use  of 
semantics  (on  the  chance  that  a  grammatical  or  lexical  bug  prevents  the  input  from  being  parsed).  Intermediate 
stages  of  the  translation  are  automatically  stored  for  later  inspection,  editing,  or  reuse. 

IRACQ  is  also  integrated  with  the  other  acquisition  facilities  available.  As  the  example  session  above 
illustrates.  IRACQ  is  integrated  with  K.REME.  a  knowledge  representation  editing  environment.  Additionally,  the 
IRACQ  user  can  access  a  dictionary  package  for  acquiring  and  maintaining  both  lexical  and  morphological 
information. 
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Such  a  thoroughly  integrated  set  of  tools  has  proven  not  only  pleasant  but  also  highly  productive 

4.3.4  Editing  an  IRule 

If  the  user  later  wants  to  make  changes  to  an  IRuie.  he/she  may  directly  edit  it.  Currently,  there  are  no  checks 
performed  on  the  edited  [Rule  to  ensure  that  it  is  consistent  with  the  domain  model. 
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5.  The  IRUS  Dictionary 


5.1  Introduction 


The  dictionary  interface  allows  a  user  to  create,  edit,  and  save  dictionary  definitions  for  the  IRUS  system 
without  having  to  know  the  details  of  how  the  definitions  are  represented.  However,  the  user  must  have  some 
understanding  of  the  category  and  feature  system  of  IRUS. 

Categories  describe  the  general  syntactic  behavior  of  a  particular  lexical  item.  In  addition,  the  categones  ADJ. 
,V.  PRO,  POSSPRO.  and  V  may  have  a  value  that  descnbes  the  way  the  lexical  item  is  inflected 

The  following  dictionary  entries  provide  some  idea  of  the  intlectional  information  (and  other  information)  that 
different  categones  can  have: 

Verbs 

[BLOW  FEATURES  ( INDOB J  I NT  RAN S  PASSIVE  TRANS) 

IMMOVABLEPARTICLES  ( (IN  BLOW-IN) ) 

N  -S 

V  ( (PNCODE  X3SG) 

(TNS  PRESENT) ) ] 

[BLEW  V  (BLOW  (PNCODE  ANY) 

(TNS  PAST)  )  ] 

[BLOWN  V  (BLOW  (PASTPART) ) ] 

Nouns 

[MAN  FEATURES  (PASSIVE  TRANS) 

N  IRR 

V  S-*ED] 

[MEN  N  (MAN  (NUMBER  PL))] 

Adjectives 

[GOOD  FEATURES  (FORTOCOMP  SUBJCOMP  SUBJHOLD  THATCOMF) 

ADJ  IRR 

CASEPREPS  ( (FOR)  (WITH)  (AT) ) 

N  -S] 

[BETTER  ADJ  (GOOD  (COMPARATIVE) ) ] 

[BEST  FEATURES  (PASSIVE  TRANS) 

ADJ  (GOOD  (SUPERLATIVE)) 

V  S-ED] 
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PRO 

[I  FEATURES  (NOPOSTMODIFIERS) 

PRO  (I  (SUBJ) 

(NUMBER  SG)  ) 

SEMANTICS  (PERSON  *) ] 

[ME  FEATURES  (NOPOSTMODIFIERS) 

COMMENTS 

Maine  abbreviation  -donb,  5/30/86 
SUBSTITUTE  ( (MAINE) ) 

PRO  (I  (OBJ) 

(NUMBER  SG) ) 

[YOU  PRO  (YOU  (SUBJ) 

(OBJ) 

(NUMBER  SG/PL) ) ] 

POSSPRO 

[MY  POSSPRO  (I  (POSS) ) ] 

Features  describe  the  syntactic  configurations  in  which  a  given  lexical  item  can  appear.  For  example,  the 
feature  INDOBJFOR  indicates  that  a  verb  may  be  followed  by  an  indirect  object  introduced  by  for.  Properties  are 
Idee  features  in  that  they  restrict  the  distnbution  of  particular  lexical  items:  however,  unlike  features,  they  can  take 
values.  For  example  the  propeny  UNIT,  which  appears  on  measure  nouns,  such  as  foot,  can  have  a  value  that 
inc'icates  the  dimension  being  measured,  such  as  LENGTH: 

[FOOT  FEATURES  (TRANS) 

N  IRR 

UNIT  (FOOT  (UNIT  LENGTH) ) 

V  S-ED] 

[HOUR  N  -S 

UNIT  (HOUR  (UNIT  TIME))] 

Some  dictionary  entries  are  associated  with  items  that  may  appear  as  separate  words  in  input  but  which  should 
be  treated  as  single  units  by  the  parser,  such  as  lio w  Iona.  Such  items  are  entered  in  the  dictionary  as  single  entnes 
with  the  character  ~  joining  the  parts.  For  example.  HOW-LONG  is  the  dictionary  representation  for  how  lont;. 
Complex  verbs  wruch  consist  of  a  verb  and  a  particle  are  represented  in  the  same  fashion.  For  example,  SET ~ OFF 
is  the  dictionary  representation  of  set  off. 

Compounds 

[NEW  ADJ  ER-EST 
COMPOUNDS 

(NEW  JERSEY)  =>  NEW- JERSEY 
(NEW  YORK)  =>  NEW- YORK] 

[NEW- YORK  NPR  *] 
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[NEW~ JERSEY  NPR  *] 


Multiples 

Yerb+Partiele 


[BLOW-UP  FEATURES  (INTRANS  PASSIVE  TRANS) 
N  -S 

V  NOINFLECTIONS] 


A  dictionary  editing  tool  is  available  to  help  the  dictionary  developer  define  new  words  and  edit  existing  ones. 
The  developer  creates  the  dictionary  definition  of  a  word  by  selecting  menu  choices  and  typing  in  information  when 
prompted.  Another  entry  point  to  the  dictionary  interface  is  provided  when  the  user  attempts  to  use  a  word  in  a  query 
that  is  not  in  the  dictionary  .  The  system  asks  the  user  if  he  wishes  to  define  the  word  and  if  he  selects  yes.  the 
dictionary  interaction  begins 

The  dictionary  tool  is  a  window  based  tool  which  consists  of  a  large  interaction  window  and  two  menus 
containing  word  commands  and  dictionary  commands.  The  word  commands  are  Add  Word,  Print  Word,  Edit 
Word,  Delete  Word  These  are  discussed  below.  The  dictionary  commands  are  Save  Dictionary,  Make  Pretty 
Dictionary.  The  dictionary  commands  are  discussed  in  Section  5.6. 


5.2  Add  Word 


When  the  dictionary  developer  selects  the  Add  Word  command,  he  is  prompted  for  the  name  of  the  w  ord  he 
wishes  to  define.  He  is  then  presented  with  a  menu  containing  the  following  options: 


BREAK 

HELP 

DESCRIBE 

NONE 


will  place  you  in  a  LISP  break.  You  should  normally  never  use  this  option, 
will  print  the  choices  and  also  some  information  explaining  them. 

will  prompt  you  for  the  name  of  a  part  of  speech  and  will  print  out  a  brief  description  of  that  part 
of  speech. 

means  that  none  of  the  current  CATEGORIES  apply.  (There  is  a  category'  called  SPECIAL  that 
should  be  considered  instead  of  NONE,  if  you  want  to  make  that  distinction.,) 


NEW-POS  allows  you  to  create  a  new  part  of  speech  not  already  on  the  CATEGORIES  list.  You  would 

not  normally  use  this  option. 


LIKE-OTHER-WORD 


allows  you  to  enter  a  second  word.  The  word  being  defined  receives  a  copy  of  the  definition  of 
that  word.  This  is  useful  even  if  the  definitions  aren't  exactly  identical,  since  it  might  be  easier 
to  edit  the  copy  than  insert  the  complete  definition  from  scratch. 


<part  of  speech>  The  other  possibilities  arc  the  parts  of  speech  stored  in  the  variable  CATEGORIES. 

For  some  parts  of  speech  (e  g  .  SPECIAL.  CONE .  NPR)  only  the  category  must  be  specified  For 
others  (e  g  .  N.  V,  ADJ i  a  specific  sequence  of  questions  prompts  for  further  information  about 
the  category.  These  options  are  described  below 

Though  the  parser  uses  many  categories,  many  of  them  are  "closed  classes"  which  you  w  ill  nol 
need  to  add  to  The  following  categories  are  the  only  ones  you  are  hkek  to  actual k  input  here: 

ADJ,  ADY.  Mount.  NPR.  V 
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Depending  on  the  category  which  the  user  selects,  the  system  presents  the  user  with  various  prompts  and  menus  to 
specify  the  characteristics  of  that  word. 


5.2.1  Nouns 


Nouns  are  all  uncapitalized  (i.e.  common}  words  which  name  a  person,  place,  or  thing. 

If  the  user  selects  to  define  a  NOUN,  the  system  displays  a  menu  asking  how  that  noun  is  made  plural.  The 
choices  are: 

S  for  nouns  like  word.  dog.  house,  mountain 

ES  for  nouns  like  box.  tax.  church 

1RR  for  nouns  that  form  their  plurals  in  an  irregular  ways,  such  as  ox: oxen,  child: children 

BOTH  FORMS  THE  SAME 
PLURAL  ALREADY 
NONE 

HELP  This  menu  choice  provides  the  user  with  information  on  the  possible  choices. 

After  the  user  has  made  his  selection,  another  menu  appears,  asking  whether  the  word  has  another  plural  form. 
If  it  does,  the  user  is  asked  to  select  it  from  a  menu. 

The  user  is  then  asked  whether  the  noun  is  non-countable.  The  HELP  choice  on  this  menu  will  provide  the 
user  with  information  on  the  meaning  of  this  question.  Non-countable  nouns  are  concrete  nouas  that  normally  do  not 
appear  with  a  number  (i.e.,  nouns  that  are  not  count  nouns)  and  which  can  appear  in  singular  form  without  a 
determiner.  Water  is  an  example  of  a  non-countable  noun.  This  feature  is  also  used  with  nouns  that  are  not 
concrete,  such  as  anger  and  freedom. 

The  user  is  then  given  an  opportunity  to  enter  a  comment  into  the  new  dicuonary  entry.  The  system  will 
automatically  create  a  comment  that  includes  the  date  and  the  user  name.  Any  comment  the  user  enters  will  be 
added  to  this 

The  user  is  then  asked  whether  there  are  any  substitution  phrases  involving  the  new  word.  The  menu  choices 
are  Yes.  No.  Help  and  Quit.  An  example  of  a  substitution  phrase  is  the  substitution  of  Jan  for  January . 
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5.2.2  Adjectives 

Adjectives  are  words  that  modify  nouns,  e.g.  good,  red,  anxious.  Words  that  are  past  or  present  participles  of 
verbs  and  might  be  used  to  modify  nouns  {e.g..  fallen,  singing)  should  not  be  classified  as  adjectives  in  the  dictionary 
because  the  grammar  will  take  care  of  them. 

When  a  word  is  classified  as  an  adjective,  you  must  specify  how  it  is  inflected  to  get  the  comparauve  and 
superlative  forms.  Some  ADJs  don’t  inflect  at  all  (e  g.  only ). 

Adjectives  may  take  features.  Those  given  only  with  examples  below  are  described  more  fully  in  the  section 
on  verbs.  The  features  PREDADJ,  SUBJHOLD.  PREDETERMINER  and  POSTNOMINAL  are  unique  to 
adjectives. 

PREDADJ  The  adjective  can  appear  only  in  predicate -adjective  position,  not  in  prenominal  position.  For 

example:  the  boy  is  afraid,  *  the  afraid  boy . 

PREDETERMINER 

The  adjective  can  occur  before  part  of  the  determiner  structure  (numbers  are  considered  part  of 
the  determiner  structure),  as  in  the  previous  (last,  biggest,  next,  most  recent  four...). 

POSTNOMINAL  The  adjective  can  occur  after  a  noun,  as  in  the  president  elect.  Some  of  the  words  in  this  class 
occur  only  after  a  noun,  others  can  also  occur  in  the  usual  prenominal  position. 

CASEPREPS  The  adjective  can  take  a  noun  phrase  complement  m  the  pattern  <adj><p><np>  The 
CASEPREPS  property  allows  the  preposition  to  be  specified  for  an  individual  adjective,  e  g 
she  is  afraid  of  him  (of).  John  is  faithful  to  Mary  (to),  they  are  too  reliant  on  force  (on),  etc. 

THATCOMP  as  in  he  is  afraid  that  he  will  fail 

INTRANSTOCOMP 

as  in  she  is  careful  to  appear  calm 

FORTOCOMP  as  in  it  is  difficult  for  him  to  hear 

SUB  J  LOW  as  in  he  is  eager  to  please  and  he  is  unlikely  to  come 

SUBJHOLD  Adjectives  which  are  marked  INTRANSTOCOMP  or  FORTOCOMP  may  also  have  this 
feature,  which  means  that  the  subject  is  held  as  the  object  of  the  complement  rather  than  lowered 
into  the  subject  position  of  the  complement.  (This  construction  is  often  called  tough  movement 
by  generative  linguists. )  For  example:  John  is  easy  to  please.  Jim  is  difficult  for  Mary  to  please 

SUBJCOMP  as  in  that  he  w  ill  come  is  unlikely 

5.2.3  Adverbs 

Adverbs  are  words  that  modify  verbs,  adjectives,  or  other  adverbs.  In  addition  to  this  part  of  speech,  an 
adverb  may  have  one  or  more  of  the  following  features:  PREDETADV,  NEGADV.  TRANSADV  and  ADJMOD 
PREDETADV  The  adverb  can  occur  before  a  noun  phrase.  (For  example,  quite  a  woman,  only  a  boy.) 

NEGADV  The  adverb  cannot  occur  with  a  negated  verb.  (For  example.  I  ate  the  cake,  I  barely  ate  the 

cake.  /  didn't  eat  the  cake.  *1  didn’t  hotels  cat  the  cake.)  In  addition,  adverbs  with  the  feature 
NEGADV  can  occur  in  the  sentence -initial  position  with  subject-verb  inversion,  as  in  Barely 
had  I  eaten.  . 

TRANSADV  The  adverb  while  still  modifying  a  verb,  can  have  an  ohiect  which  is  an  NP,  as  in  especialls  the 
fat  men. 
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ADJMOD  The  adverb  can  modify  adjectives,  as  in  almost  red ,  and  always  delightful. 

5.2.4  Verbs 

For  verbs,  vou  will  be  asked  if  the  word  vou  provided  is  the  root  form : 

Is  it  the  root  form? 

You  will  then  be  asked  for  the  type  of  verb.  The  verb  types  are  the  following: 

S-D  if  the  word  is  a  root  inflected  like  gauge. 

S-ED  if  the  word  is  a  root  inflected  like  learn. 

ES-ED  if  the  word  is  a  root  inflected  like  kiss. 

IRR  if  the  word  is  an  irregular  verb  Like  go,  be  or  see:  you  will  be  prompted  for  the  root  of  the  verb 

and  its  conjugation 

INFLECTED  if  the  word  is  already  inflected  dike  gives  or  had):  you  will  be  prompted  for  the  other  parts 

When  this  information  has  been  collected  about  a  verb,  information  about  the  features  of  the  verb  will  be 
gathered.  Even  verb  must  be  transitive  or  intransitive  (or  both);  transitive  verbs  may  or  may  not  take  a  direct  object 
and  may  or  may  not  be  made  passive.  Explicit  questions  are  asked  for  each  of  these  features.  If  the  verb  takes 
panicles,  they  may  be  either  regular  particles  or  immovable  particles.  Finally  the  system  will  ask  Does  it  take  any 
complements?  If  the  answer  is  y,  you  will  be  allowed  to  choose  one  or  more  features  from  a  list.  (You  can  see  the 
list  by  typing  a  ?.)  In  the  future,  more  structure  will  be  provided  here,  as  some  of  the  features  aren't  really 
complement  features  and  some  are  dependent  on  others,  but  for  now  you  can  choose  any  subset.  When  you  have 
finished  entering  features,  type  END  to  the  ">"  prompt. 

Verbs  are  the  most  complex  words  to  define  because  they  may  take  so  many  features. 

ASPECTYF.RB  This  is  used  for  verbs  that  take  infinitive  or  -mg  complements  (most  take  both,  some  take  only 
one)  and  that  have  an  aspectual  interpretation.  For  example,  begin  to  read,  begin  reading,  have 
to  leave,  slop  trying.  Verbs  marked  with  this  property  do  not  appear  as  verbs  in  the  parse 
structure.  Instead,  they  appear  as  features  of  the  Auxiliary. 

CASEPREPS  The  CASEPREPS  property  is  used  with  adjectives  and  verbs  that  take  (either  optionally  or 
obligatorily)  a  particular  preposition,  rather  than  prepositional  phrases  in  general.  The  value  is  a 
list  of  pairs,  the  first  element  being  the  preposition,  the  second  being  a  semantic  case  .An 
example  of  apossible  value  is  ((TO  RECIPIENT)  (FROM  DONORli 

ADJCOMP  Verbs  with  this  feature  can  take  adjectives  as  complements.  For  example,  stand  rail. 

TRANSADJCOMP 

Similar  to  ADJCOMP.  but  the  pattern  is  <subjectxvxnpxadj>.  For  example,  he  drives  me 
iracy  .  we  Jmd  him  guilty. 

immovableparticles 

These  are  verb-particle  pairs  in  which  either  there  ts  no  noun  phrase  object  or  the  particle  does 
not  move  after  the  object  i  as  in  he  washed  up  the  dishes,  he  washed  up.  *he  washed  tin  dishes 
up).  The  pattern  is  <subjxvxnarticlexnp>  or  <subjxv><particle>. 

INTRANS  Intransitive  verbs  can  appear  without  an  object  noun  phrase,  e.g  eat  and  die 

INTRANSTOCOMP 


42 


Report  No.  7144 


BBN  Systems  and  Technologies  Corporation 


These  verbs  (and  adjectives)  can  only  take  a  to  complement  without  a  preceding  NP,  as  in  agree 
to  buy.  forget  to  call,  threaten  to  do  it. 

A  verb  can  have  only  one  of  the  features  TOCOMP,  INTRANSTOCOMP.  and  NPTOCOMP. 

Transitive  verbs  can  appear  with  an  object  noun  phrase,  e.g.  eat.  give,  and  assist.  Verbs  which 
are  transitive  may  also  have  one  or  more  of  the  following  features: 

The  verb  can  appear  with  its  deep  structure  object  before  the  verb  and  its  deep  structure  subject 
in  a  oy  pudx.,  lu  :U  cr'  ,'by  the  Many  transitive  verbs  can  be  passivi?ed  but 

some  cannot  (e.g.  resemble ,  cost ,  weigh). 

The  verb  can  take  an  indirect  object,  which  appears  with  to  or  for.  (The  indirect  object  may  or 
may  not  be  required.)  Usually  there  is  a  form  without  the  preposition  in  which  the  indirect 
object  comes  before  the  direct  object.  For  example:  John  gave  the  book  to  him  and  John  gave 
him  the  book.  There  are  a  few  verbs  that  appear  only  in  the  dative  form  (with  to  or  for),  e.g. 
donate,  explain. 

INDOBJ&THATCOMP 

The  verb  can  take  a  that  complement  together  with  an  indirect  object.  For  example:  /  told  her 
that  he  lied.  Show  him  that  it  is  raining.  The  pattern  is:  <subjxvxNP>that<S>  (In  some  cases 
the  that  is  optional:  in  others  it  is  required.  See  the  discussion  of  the  feature 
THATREQCIRED  later  in  this  section.) 

INDOBJFOR  The  verb  can  take  an  indirect  object  which  allows  dative  movement  with  the  word  for.  (Again 
the  indirect  object  may  or  may  not  be  required.)  For  example:  allow,  find,  make.  Note:  verbs 
tiled  have  'be  feature  INDOBJFOR  cannot  take  INDOBJ&THATCOMP,  too. 

BITRANSnTV'E  The  verb  can  take  two  noun  phrases  (one  of  which  would  seem  to  be  an  indirect  object!:  the 
pattern  is  <subjxVxNPlxNP2>  but  not  <subjxV>sNP2>f  io,for}<NPl>.  For  example. 
they  showed  him  a  good  time.  *they  showed  a  good  time  to  him.  he  forgave  us  our  sins,  and  we 
en\y  you  your  wealth. 

PARTICLES  This  is  an  indication  that  the  verb  to  which  this  property  is  attached  may  take  one  or  more 
particles.  Each  verb-panicle  pair  is  stored  as  a  separate  entry  in  the  dictionary,  since  such  a  pair 
usually  has  a  meaning  quite  different  from  the  verb  alone.  Panicles  appear  either  before  or  after 
the  direct  object  (as  in  he  cleaned  up  the  mess,  he  cleaned  the  mess  up).  A  verb  may  take 
several  particles,  as  in  give  up,  give  in.  give  out. 

ITSUBJ-THATCOMP 

This  feature  indicates  a  nusing-io-subject  verb,  which  either  takes  the  word  it  in  subject  position 
together  with  a  following  that  complement,  or  takes  an  infinitive  (to  +  verb)  whose  subject 
appears  as  the  subject  of  the  original  verb.  It  seems  that  John  is  here.  John  seems  to  be  here. 

IDQOBJECT  The  verb  can  take  an  indirect  question  word  in  the  object  position  without  making  the  whole 
sentence  a  question  For  example:  /  know  who  did  it.  They  pondered  over  whether  to  do  it  and  / 
asked  the  man  how  to  get  to  flanard  Square. 

OBLIGATOR  YADV 

The  verb  require  an  adverb,  e  g.  he  put  it  there,  they  meant  well. 

The  three  features  PERCEIV  ECOMP,  PARTICIPLECOMP,  and  B ARE-INF-COMP  are 
mutually  exclusive  In  addition,  a  verb  may  have  only  one  of  the  features  MAKECOMP. 
PARTICIPLECOMP,  and  PERCEIV  ECOMP 
PERCEIV ECOMP 

Verbs  with  this  feature  can  appear  in  the  active  form  <VxNPx-tng  or  infinitive  complement 
and  in  the  passive  form  <bexV>to<infinitive  complement.  For  example:  /  heard  John 
readdng  the  story)  and  John  was  heard  to  complain.  Only  a  few  verbs  of  perception,  such  as 
hear,  nonce,  sec.  and  fee!  base  this  feature. 

PARTICIP*  ECOMP 

Verbs  with  this  feature  must  be  transitive  ami  may  take  a  -ing  or  -ed  complement.  If  no  NP 
occurs  after  the  verb,  the  complement  must  he  -ing.  For  example  /  deny  sending  it.  he  saw  her 
reading  it.  »  e  had  her  treated  at  the  hospital. 


TRANS 

PASSIVE 

INDOSJ 
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BARE-INF-COMP 

The  only  verbs  with  this  feature  are  have,  let ,  and  make.  The  complement  must  appear  with  a 
preceding  NP  and  a  bare  infinitive  in  the  active  form,  and  with  to  and  the  infinitive  in  passive 
form.  For  example:  Make  him  eat.  He  was  made  to  eat. 

MAKECOMP  Verbs  with  this  feature  act  rather  like  transitive  copulas.  There  are  only  a  few1  in  English,  e  g. 

declare,  make,  consider.  Some  examples  are:  Zhe  made  him  a  good  husband,  l  consider  him  a 
fool. 

NPPOCOM?  The  verb  icv^^es  an  object  NP  (which  appears  before  uie  io  complement  in  active  sentences,). 

For  example,  /  believe  him  to  have  eaten  it.  He  order  them  to  stop.  John  was  believed  to  hav e 
died. 

SUBJLOW  Verbs  which  are  marked  NPTOCOMP  or  TOCOMP  may  also  have  this  feature,  which  means 
that  the  subject  of  the  matrix  sentence  is  always  lowered  into  the  complement  to  become  the 
subject  there.  For  example:  Mary  promised  John  to  leave  early. 

THATCOMP  The  verb  takes  a  that  complement,  such  as  say  (that)  it  is  true,  note  that  he  faded.  Such  verbs 
are  usually  transitive,  but  not  ".'ways  (e.g.  agree). 

THATREQUIRED 

Some  thatcomp  verbs  are  also  marked  with  this  feature,  which  means  that  the  that  cannot  be 
deleted  from  the  complement  (e  g.  *he  commanded  they  stop).  Some  verbs  with  this  feature 
will  be  borderline  (agree i:  perhaps  the  grammar  should  be  willing  to  relax  this  constraint.  If  this 
feature  is  absent  but  THATCOMP  is  present,  assume  that  the  that  Is  optional. 

SUBJUNCTIVEREQLIRED 

Some  THATCOMP  verbs  are  also  marked  with  this  feature,  which  rr“T.s  that  the  v»rh  in  the 
embedded  S  is  subjunctive.  For  example,  /  demand  that  he  be  punished.  *1  demand  that  he  is 
punished.  ?/  demand  my  lawyer  be  present.  Verbs  of  this  type  seem  to  prefer  their  complements 
ic  entair.  a  that 

COPULA  The  verb  acts  like  me  copula  be  in  predicate  adjective  constructions,  like  he  (feels,  appears, 

becomes,  seems}  sad. 

FORTOCOMP  These  verbs  (and  adjectives)  allow  for  to  complements  in  which  'he  for  and  its  NP  are  alwavs 
optional.  The  paradigm  is  <V>(for  NP)to<V>.  For  example:  /  wish  ( for  Join.)  to  rti*.  i prefer 
(for  him)  to  go.. 

Compare  this  feature  with  INTRANSTOCOMP  and  TOCOMP.  Some  verbs  with  the  feature 
TOCOMP  can  look  like  FORTOCOMP  verbs  when  they  contain  a  long  adverb  in  the  middle 
(/ asked  very  loudly  for  him  to  leave),  but  they  should  be  marked  only  TOCOMP. 

A  verb  may  not  be  marked  both  FORTOCOMP  and  INTRANSTOCOMP:  words  which 
appear  to  be  both  should  be  marked  FORTOCOMP. 

Verbs  which  are  FORTOCOMP  take  this  feature  if  the  subject  is  not  to  be  lowered  to  become 
the  subject  of  the  lower  verb.  For  example.  He  said  to  go.) 

This  feature  may  appear  on  INTRANS  verbs  and  some  AD/s  which  allow  a  complement  in  the 
subject  position.  Often  this  subject  is  replaced  by  it  and  the  complement  appears  later.  For 
example:  for  him  to  pass  the  test  is  unlikely  ,  it  is  unlikely  that  he  passed  the  test,  that  he  will 
pass  it  is  unlikely .  to  pass  the  test  is  hard,  it  is  hard  to  pass  the  test. 

This  feature  applies  to  verbs  which  optionally  allow  an  NP  before  a  to  complement.  For 
example:  Ask  (them)  to  leave.  I'd  like  (himi  to  sir  dun  n.  He  chose  (hen  to  go.  (If  an  NP  is 
required,  the  verb  should  be  labeled  NPTOCOMP;  if  it  is  never  allowed,  it  should  be  labeled 
INTRANSTOCOMP.)  Note  that  in  some  cases,  a  long  adverb  before  the  first  NP  results  in 
what  looks  like  a  FORTOCOMP  /  asked  very  loudly  for  him  to  leave:  NPTOCOMPs  never 
do  this  Mark  these  verbs  as  TOCOMPs.  not  FORTOCOMPs  Verbs  with  TOCOMP  may 
also  be  marked  SUBJLOW. 

It  is  very  important  to  distinguish  all  forms  of  to  complements  from  the  construction  which  subsututes  to  for 
in  order  to.  Most  sentences  allow  an  in  order  to  construction  an  adjunct  (modifier  i.  as  in  he  ran  to  catch  the  train. 


NOLOW 

SUBJCOMP 

TOCOMP 
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and  the  grammar  handles  these.  A  verb  should  be  assigned  a  to  complement  feature  only  it  the  examples  do  not 
allow  the  in  order  to  substitution. 


5.2.5  Other  Categories 

BINDER  These  are  subordinating  conjunctions,  such  as  although,  if.  while  and  because. 

COMP  These  are  words  used  in  comparative  constructions,  such  as  at  least  ( AT-LEAST ),  less,  nearly. 

CONJ  These  words  are  conjunctions  for  simple  consmuents  such  as  .VP's.  For  example,  and.  as  well 

as  (AS~WELL~A S),  but,  unless,  or,  yet. 

DET  Determiners  are  words  that  can  occur  before  adjectives  in  an  VP.  Words  that  other  systems 

might  classify  as  quantifiers  or  articles  are  D£Ts  in  the  1RU3-I1  system  (e  g.  a,  all,  both,  no. 
that,  the.  these,  what,  which). 

INTEGER  These  are  number  words,  such  as  sesen.  eighteen,  forty .  (VB:  the  class  of  integers  is  not  closed, 

but  the  class  of  words  used  to  express  the  integers  is.  at  least  for  our  purposes,  i 

INTENSIFIERADV 

means  that  the  word  can  modify-  an  adverb,  as  in  most  quickly,  very  quickly,  too  quickly  and 
really  quickly. 

MONTH  These  are  the  months.  January  through  December.  (Abbreviations  such  as  Jan  and  Feb  are 

handled  with  a  SU  BSTITUTE  property.) 

UNIT  These  words  are  nouns  which  are  also  units  of  measurement,  such  as  foot.  sear,  degree,  gallon. 

tablespoon  and  meter.  The  value  of  this  property  gives  the  root  form  of  the  unit  together  with 
the  feature  UNIT  and  an  optional  value  indicating  the  dimension  of  that  unit.  Thus  the  UNIT 
property  of  the  word  FEET  is  (FOOT  (UNIT  LENGTH))  while  that  of  DOLLAR  is 
(DOLLAR  (UNIT)).  The  dimension  is  needed  if  the  unit  participates  in  constructions  such  as 
ten  feet  long,  a  meter  wide  and  three  minutes  long.  Currentlv  recognized  values  for  this  property 
are  LENGTH  TIME.  VOLUME.  TEMPERATURE 

Words  like  lone.  wide.  fill,  and farenheit  have  the  features  TIME.  LENGTH.  VOLUME,  and 
TEMPERATURE  respectively  tc  indicate  that  they  can  be  used  in  phrases  such  as  3  feet  long. 
2  hours  long.  2  teaspoons  full.  9  feet  deep.  17  degrees  Farenheit. 

NEC  The  only  entries  in  this  class  are  not  and  n't. 

NPR  Proper  nouns  are  nouns  such  as  Apollo.  China.  Washington.  Proper  nouns  are  (usually)  not 

modified  by  adjectives  and  (usually)  do  not  take  determiners. 

DEI  REQUIRED  This  is  for  proper  nouns  that  do  take  a  definite  determiner,  such  as  the  Hague,  the  White  House. 

the  Netherlands.  The  grammar  will  allow  proper  nouns  with  this  feature  to  be  used  without  a 
determiner  if  the>  function  as  modifiers,  for  example.  White  House  spokespeople  agreed  ... 
(There  are  a  few  SPRs  such  as  Earth  that  can  appear  with  or  without  a  determiner,  bm  the  case 
without  can  probably  be  treated  as  a  plain  noun  rather  than  as  an  NPR. ) 

ORD  These  are  the  ordinal  numbers,  e  g.  first,  second.  ... 

POSS  The  only  words  in  this  category  are  "  and  'S. 

POSSPRO  These  are  the  possessive  pronouns  my.  your.  his.  her.  our,  their,  us.  etc. 

PREP  Prepositions  introduce  prepositional  phrases  which  can  modify  verbs,  nouns,  or  adjectives 

•  Some  FREPs  are  about,  ahead  of  [AHEAD-OF),  up.  for).  Prepositions  can  take  three  features: 

BAREPREP  This  feature  means  that  the  preposition  can  occur  alone  without  an  object  NT.  for  example. 

below .  down,  hen .  inside,  there  and  tip.  (Some  dictionaries  consider  these  words  to  K-  adverbs, 
hui  we  disagree. i  Some  barepreps  can  optional!)  take  a  noun  phrase  as  their  object,  for 
example,  walk  out.  walk  out  the  door  j 
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NONPOBJECT  This  features  indicates  items  that  are  treated  as  PREP s  by  the  parser  which  cannot  take  an  S'P 
object.  Such  items  include  here,  there,  etc. 

PPOBJECT  These  prepositions  can  take  whole  prepositional  phrases  as  their  objects,  as  in  here  from  the  cm. 
across  from  John' s  house,  and  near  to  the  hike. 

PRO  Pronouns  are  all  the  non-possessive  and  non-question  pronouns,  such  as  anyone,  he.  her.  I.  me. 

myself,  that  and  it.  Pronouns  may  have  the  feature  NOPOSTMODIFIERS 

I'.OPOSTMODITfERS 

The  pronoun  cannot  participate  in  constructions  such  as  she  who  l  heard  from  and  he  who 
laughs  last. 

These  are  punctuation  marks,  such  as  ..  ., and  others. 

These  words  are  question  determiners:  how  many  (HOW-MANY),  how  much  (HOW-MUCH \, 
what,  which  and  whose. 

These  are  the  question  pronouns  what,  w  ho,  whom. 

These  are  the  quantifier  adjectives  how  many  (HOW-MANY),  many,  and  several. 

These  are  the  question  words  how ,  what  time  ( WHAT-TIME ).  w  hen,  w  here,  and  w  h\ . 

These  conjunctions  can  conjoin  entire  sentences,  such  as  and.  but.  nor.  or,  though,  until,  and  \et 
A  word  may  be  both  a  CONJ  and  a  SENTCONJ. 

SHORTANSWERThese  words  can  be  used  alone  as  answers  to  questions  or  as  unreductory  words  to  a  sentence. 

For  example:  ne\er  mind  (S'F\'ER~MIND ),  no,  yes,  oh  and  ok.  For  our  purposes,  this  is  closed 
class. 

SPECIAL  These  are  a  few  words  that  simply  don't  belong  in  any  of  the  other  categories,  such  as  than  and 

etc. 

TITLE  For  example:  Mr,  Ms.  Dr.  Chief  Justice  (CHIEF-JUSTICE),  sir. 


PUNCT 

QDET 

QPRO 

QUANTADJ 

QWORD 

SENTCONJ 


5.3  Edit  Word 


The  user  may  choose  to  edit  an  existing  word  This  max  either  be  a  word  that  was  added  dunng  the  current 
session,  or  a  word  that  is  already  in  the  dictionary.  When  the  user  chooses  Edit  Word  from  the  dictionary  menu,  he 
is  prompted  for  the  name  of  the  word  he  wishes  to  edit.  The  words  added  dunng  the  current  session  arc  checked 
first.  The  files  that  were  opened  dunng  system  initialization  are  then  tned  one  at  a  time  until  the  won!  is  found. 
Thus,  a  more  recently  opened  file  takes  precedence  over  a  previously  opened  file.  The  purpose  of  this  mechanism  is 
to  allow  patch  files  that  override  an  existing  file  If  the  word  does  not  have  a  dictionary  entry,  the  system  displays  a 
message  to  that  effect  If  there  is  a  dictionary  entry  for  the  word,  the  current  definition  of  the  w  ord  is  display  ed  on 
the  screen.  The  user  is  then  presented  w  ith  a  number  of  choices  that  can  be  selected  from  a  menu. 

Want  to  Check,  Edit,  Replace,  Delete,  Add,  Help,  File  or  Ok? 

The  options  here  are: 

•  Check:  checks  the  consistency  of  the  current  syntactic  features  of  word.  If  they  are  consistent,  the 
message 

word  is  alright. 

will  he  displayed  If  there  is  some  inconsistency,  the  following  message  will  be  printed  out 
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Problems  with  word: 

followed  by  a  specification  of  the  inconsistency.  The  editor  checks  for  the  following  forms  of 
inconsistency: 

Presence  of  a  feature  that  is  only  associated  with  a  particular  category  : 

for  example,  only  an  ADJ  can  have  the  feature  PREDADJ  If  a  word  that  is  not  an  ,-WJ  has  this 
feature,  the  following  type  of  message  would  be  pnnted  out: 

((word  PREDADJ  requires  ADJ) I 
Presence  of  a  feature  that  requires  another  feature: 

for  example,  only  V  s  that  have  the  feature  TRANS  can  have  the  feature  PASSIVE.  If  a  word  has  the 
feature  PASSIVE  without  the  feature  TRANS,  the  following  type  of  message  would  be  printed  out: 

((word  PASSIVE  requires  TRANS)) 

Occasionally  ,  a  feature  is  contingent  on  one  of  several  features.  For  example.  TH.4TREQUIRED  can 
appear  on  a  word  that  has  one  of  the  features  THATCOMP  or  INDOBJ&THATCOMP.  If  a  word 
that  does  not  have  one  of  these  features  has  the  feature  THATREQL1RED.  the  following  type  of 
message  would  be  printed  out: 

((word  TH  ATREQUIRED  requires-one-of  (THATCOMP  INDOBJ&THATCOMP))) 

•  Edit:  Use  the  Edit  command  only  if  vou  are  sure  you  understand  the  internal  representation  of  the  kind 
of  property  you  are  changing.  Use  Delete  and  Add  otherwise. 

•  Replace:  prompts  for  the  replacement  value  for  the  specified  property. 

•  Delete:  is  used  to  remove  one  of  the  existing  properties  of  word.  Removes  the  SPECIFIED  property 
from  the  dictionary  entry 

•  Add  is  used  to  add  a  property .  You  will  be  prompted  for  the  value  of  the  property  just  as  if  you  were 
defining  that  part  of  word  for  the  first  time.  Adds  the  specified  property  to  the  dictionary  entry. 

•  Help  prints  a  help  message  for  the  specifed  property  . 

•  File:  exit  and  save  the  dictionary  entry  in  dictionary.  File  and  Ok  both  indicate  that  you  are  satisfied 
with  the  definition  of  the  word  and  wish  to  exit  from  the  word  editor,  but  File  indicates  that  you  want 
word  to  be  filed  immediately. 

•  Ok:  exit  and  leave  the  dictionary  entry.  Indicates  that  you  are  satisifed  with  the  definition  of  word,  but 
that  you  do  not  want  to  file  it  now. 


Each  time  you  make  a  modifkahon  to  word,  the  definition  will  be  redisplayed  along  with  the  question.  Want 
to  Check,  Edit,  Replace,  Delete,  Add.  Help.  File  or  Ok? 


5.4  The  ('(impounds.  Multiples,  and  Substitute  Properties 


The  follow  ing  properties  are  not  restneted  to  appearing  on  particular  parts  of  speech  and  can  appear  on  any 
dictionary  entry 
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5.4.1  Compounds 


The  COMPOLrVDS  proj>erty  is  attained  to  words  which  begin  compound  phrases.  The  compound  phrase 
should  be  an  entry  in  the  dictionary . 

(UNIT 

COMPOUNDS  ( (REPORT  (UNIT-REPORT) ) 

(IDENTIFICATION  NIL  (CODE  (UNIT- IDENTIFICATION-CODE) ) ) ) 

N  -S 

SEMANTICS  (UNIT  *) ) ) ) 


5.4.2  Multiples 


The  MULTIPLES  property  is  attached  to  words  whose  meaning  can  be  expressed  w  ith  an  alternate  phrase 

(PACIFIC  MULTIPLES  ( (FLEET  (PACIFIC-FLEET) ) 

(OCEAN  (PACIFIC-OCEAN)))) 


5.4.3  Substitute 


The  SLrBSTITT'TE  property  indicates  that  the  word  in  question  should  be  replaced  by  the  given  substitute 
word.  This  is  used  for  abbieviahons  ie  g.  Jan  for  January)  and  contractions  (e  g  can't). 

(JAN  SUBSTITUTE  ( (JANUARY) ) ) 


5.5  Print  Word 


The  selection  of  Print  Wird  on  the  dictionary  menu  enables  the  user  to  display  the  definition  of  a  word  on 
the  screen.  The  user  is  prompted  for  the  word  to  be  displayed  If  the  word  is  undefined,  the  system  displays  a 
message  to  that  effect 
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5.6  Dictionary  Commands 


The  two  dictionary  commands  on  the  dictionary  menu  are  Save  Dictionary  and  Make  Pretty  Dictionary 

5.6.1  Save  Dictionary 

The  Save  Dictionary  command  saves  the  words  into  dictionary  files. 


5.6.2  Make  Pretty  Dictionary 

The  Make  Pretty  Dictionary  command  enables  the  user  to  get  a  version  of  the  file  that  people  can  read. 
Keep  in  mind  that  since  each  word  has  to  be  read  from  the  dictionary  file  and  written  to  the  pretty  file,  making  a 
pretty  file  can  be  tune  consuming. 


5.7  Semantics 


Chapter  4  referred  to  the  fact  that  dictionary  entries  link  words  to  the  corresponding  domain  model  concepts. 
In  some  of  the  examples  in  the  current  chapter,  SEMANTICS  has  appeared  as  one  of  the  features  of  the  example 
words.  In  this  section  we  briefly  discuss  the  process  of  assigning  SEMANTICS  to  dictionary  entries.  The 
motivation  for  assigning  specific  SEMANTICS  features  is  to  be  found  in  the  domain  model  and  the  IRULES 
developed  for  the  domain 

To  add  SEMANTICS  to  a  word,  the  user  selects  SEMANTICS  from  the  menu  Usung  categories  and  features. 
This  menu  is  displayed  during  the  process  of  adding  or  editing  a  dictionary  entry  The  user  then  selects  ADD  from 
the  next  menu  The  user  is  prompted  for  the  SEMANTICS  to  be  added.  The  SEMANTICS  property  is  a  list  of  pairs 
associating  semantic  categones  (concepts  or  roles  in  the  domain  model,  or  names  of  irules)  and  restnetioas  on  those 
categories.  First  the  category  is  entered:  then  the  restnetion.  Usually  there  is  no  restriction,  so  a  value  of  *  is 
entered.  The  following  examples  illustrate  unrestricted  semantic  properties  of  some  words. 

(SUBMARINE  N  -S 

SEMANTICS  (SUBMARINE  *) 

(CARRIER  N  -S 

SEMANTICS  (CARRIER .VESSEL  *) 

(DEPLOY  FEATURES  (BITRANSITIVE  INTRANS  PASSIVE  TRANS) 

SEMANTICS  (DEPLOYMENT  *) 

V  S-ED) 
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A  review  of  these  examples  indicates  that  there  are  few  surprises  in  the  assignment  of  SEMANTICS  to  a 
word.  The  word  SUBMARINE  is  represented  in  the  IRULEs  as  the  domain  model  concept  SUBMARINE.  The 
word  carrier  has  the  SEMANTICS  (CARRIER.YESSEL  *)  associated  with  it.  The  semantic  interpretation  of 
carrier  is  thus  dependent  on  the  treatment  of  CARRIER. VESSEL  in  the  domain  model  and  the  IRULEs.  Likewise, 
the  verb  deploy  is  semantically  characterized  as  (DEPLOYMENT  *). 

The  following  are  examples  of  words  whose  SEMANTICS  properties  have  restrictions  on  the  semantic 
categories: 

(FREDERICK  FEATURES  (DETREQUIRED ) 

NPR  * 

SEMANTICS  (CONVENTIONAL .SURFACE . SHIP  (NAMEOF  "Frederick"))) 

(HONOLULU  NPR  * 

SEMANTICS  (MULTIPLE -SEMANTICS 

((PORT  (NAMEOF  "Honolulu") 

NUCLEAR. SUB  (NAMEOF  "Honolulu"))))) 

Frederick  happens  to  be  the  name  of  a  ship.  The  SEMANTICS  property  specifies  this  with  the  expression 
(CONVENTIONAL.SIRFACE.SHIP  (NAMEOF  "Frederick")).  Honolulu  has  two  semantic  categories 
associated  with  it,  because  it  is  both  the  name  of  a  port  and  the  name  of  a  nuclear  submarine.  Each  semantic 
category  has  a  (NAMEOF  "Honolulu")  restriction. 


Report  No.  7 144 


BBN  Systems  and  Technologies  Corporation 


6.  World  Model  Language  Representations  of  Queries 

6.1  Intensional  Logic 

One  of  the  difficulties  in  developing  a  system  of  logical  representation  for  natural  language  is  that  traditional 
logic  is  much  more  limited  in  its  expressive  range  than  natural  language.  Intensional  logic  represents  one  direction 
in  broadening  the  expressive  range  of  logic  so  that  it  covers  a  broader  scope  of  natural  language. 

Intensional  logic,  as  implemented  at  BBN.  extends  the  capabilities  of  extensional  logics  by  allowing  explicit 
reference  to  times  and  possible  worlds.  For  example,  it  can  represent  propositions  whose  truth  values  vary 
depending  on  time  (a  characteristic  typical  of  real-worid  applications);  in  addition,  it  provides  for  propositions 
which  are  either  true  in  some  possible  world,  or  necessarily  true  in  all  possible  worlus. 

The  INTENSION  operator  makes  this  possible.  The  INTENSION  of  an  expression  represents  a  function 
taking  two  arguments:  a  time  specification,  and  a  world  specification.  Given  these  specifications  (which  can  be 
thought  of  as  contextual  indices),  the  function  returns  the  denotation  of  the  expression  at  the  time  and  world 
specified  So.  for  example,  the  intensional  expression  INTENSION alive(shakespeare))  represents  a  function  that 
takes  a  time  argument  and  a  w’orld  argument,  and  returns  a  truth  value.  With  arguments  July  1.  1988  and 
current-world,  the  value  returned  is  FALSE:  with  arguments  July  1 .  1600  and  current-world,  the  value  returned  is 
TRUE.  Terms,  as  well  as  propositions,  may  have  intensions:  for  example.  INTENSIONf'the  person  reading  this 
sentence  )  represents  a  function  which,  given  the  current  instant  and  the  current  world,  denotes  you.  the  reader. 

In  many  cases,  querying  an  underlying  system  can  be  thought  of  as  extensionalizing  a  description  given  by  the 
user.  Part  of  the  WML  representation  of  "Which  earners  are  Cl"  represents  the  inteasion  of  the  set  of  Cl  earners, 
and  it  is  the  system’s  task  to  extensionalize  this,  returning  the  set  denoted  by  the  expression,  in  this  case  in  the 
current  time  and  world. 

In  general,  intensional  logic's  specification  of  lime  and  world  permits  the  representation  of  complex  natural 
language  utterances.  This  new  logic  is  powerful  enough  to  express  generics,  collective  quantification,  modalities, 
tense,  propositional  attitudes,  mass  temis.  and  discourse  context. 

In  practice,  of  course,  many  quenes  do  not  require  the  use  of  explicit  contextual  indices.  In  such  cases,  the 
time  and  world  arguments  default  to  TIME  and  WORLD,  which  represent  the  current  time  (time  of  the  utterance)  and 
current  world 
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6.2  Domain  Model  Constants  and  Predicates  in  WML 

In  addition  to  branch-categories  such  as  SETOF  and  INTENSION,  and  logical  constants  such  as  TRUE  and 
NULL-SET,  WML  comprises  elements  from  the  domain  model.  The  roles  in  the  domain  model  correspond  to 
binary  predicates  in  WML;  for  example,  the  domain-model  role  VESSEL. NAME  corresponds  directly  to  the  binary 
WML  predicate  vessel. name(. r,  yj,  which  is  a  relation  taking  two  arguments  of  the  same  types  as  the  domain  and 
range  of  VESSELJJAME  (namely,  vessels  and  strings).  The  concepts  in  the  domain  model  correspond  to  unary 
WML  predicates,  or,  equivalently,  to  the  positive  extensions  of  those  predicates.  For  example,  the  domain-model 
concept  VESSEL  corresponds  to  the  WML  predicate  vessel(x),  or  to  the  corresponding  set  {x/vessel(x)j. 

When  one  expands  the  domain  model,  therefore,  one  is  also  extending  the  World  MotH  Language,  by  adding 
new  unary  and  binary  predicates.  Section  4.2  discusses  how  IRules  are  used  to  compose  WML. 

6.3  Example 

Here  we  present  an  example  of  a  query's  representation  in  WML.  pointing  out  its  relationship  to  the  domain 
model,  and  explaining  some  aspects  of  its  construction. 

Query:  "Which  ships  are  in  the  Indian  Ocean?" 

WML  expression  generated  by  IRUS-II: 

(QUERY 

( (INTENSION 
(PRESENT 

(INTENSION 

(IOTA  ? JX9  (POWER  VESSEL) 

(IN. PLACE  ? JX9  INDIAN. OCEAN) ))) ) 

TIME  WORLD) ) 

VESSEL  and  INDIAN  .OCEAN  correspond  to  concepts  in  the  domain  model:  IND/AN.OCEAN  is  an  individual 
concept,  and  for  this  reason  is  treated  as  a  constant  in  the  WML  expression.  The  binary  predicate  IN.PL\CE 
corresponds  to  a  role  in  the  domain  model  -  a  role  whose  domain  includes  vessels,  and  whose  range  includes  the 
Indian  Ocean  (were  this  not  the  case,  semantic  constraints  stated  in  the  Irules  w'ould  have  been  violated,  and  this 
WML  expression  never  produced  i. 

The  central  subexpression  in  this  query  is 

(IOTA  ?JX9  (POWER  VESSEL)  (IN. PLACE  ?JX9  INDIAN. OCEAN >). 

IOTA  introduces  a  scoping  expression,  and  as  such  introduces  a  sorted  van  able.  ?JX9  in  this  case,  which  in 
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constrained  to  denote  an  element  of  the  set  of  all  possible  sets  of  VESSELS,  indicated  by  (POWER  VESSEL)  in  the 
son  field  of  the  expression.  The  body  of  the  IOTA  expression  further  constrains  the  possible  values  of  ?JX9,  to 
require  that  the  IN. PLACE  relation  holds  between  it  (?JX9)  and  INDIAN. OCEAN.  An  expression  with  branch 
category  IOTA  denotes  an  individual,  or  set  of  individuals,  and  indicates  that  only  element  of  the  set  that  its  variable 
ranges  over  is  salient,  i.e.,  in  this  example,  only  one  maximal  SET  of  VESSELS  is  expected  to  be  a  possible 
instantiation  for  ?JX9.  (Note  that  this  is  not  Russel’s  iota ,  which  is  denotationless  if  mere  than  one  instantiation  is 
possible:  in  WML,  if  a  IOTA  variable  ranges  over  a  power  set.  as  in  this  example,  the  IOTA  denotes  the  maximal 
set  that  the  variable  can  range  over  (the  maximal  set  of  VESSELS  in  the  INDIAN  OCEAN).  If  the  variable  ranges 
over  a  simple  set,  only  one  possible  instantiation  is  expected  for  the  variable,  and  more  than  one  (or  none)  indicates 
a  presupposition  violation.)  • 

Wrapping  the  INTENSION  operator  around  this  subexpression,  we  have  an  expression  that  represents  a 
function  from  times  and  worlds  to  sets  of  vessels:  the  PRESENT  operator  partly  constrains  the  function,  requiring 
that  the  time-index  argument  be  within  the  present"  (however  that  is  defined)  in  order  to  have  a  value  returned 
Given  that  the  TIME  and  WORLD  arguments  immediately  follow,  what  we  really  have  is 

(QUERY'  <extension>). 

where  <e.\tenston>  denotes  the  set  of  ships  in  the  Indian  Ocean  in  the  current  time  and  world.  The  speech-act 
(QUERY)  indicates  that  the  extension  s  value(s)  should  be  returned  to  the  user. 

6.3.1  WML  Types 

The  formal  syntax  and  semantics  of  WML  are  in  [II],  In  this  and  the  following  section,  we  present  an 
overview  of  the  type  system  and  the  definition  of  meaningful  WML  expressions. 

Each  valid  WML  expression  has  a  branch  category  ,  usually  the  CAR  of  the  expression.  For  example,  the 
branch  category  of 

(EXISTS  ?JX1  UNIT  (UNITS. OVERALL. READINESS  ?JX1  Cl)) 
is  EXISTS.  Branch  categories  are  the  "glue"  that  make  complex  WML  expressions  out  of  simple  expressions.  Each 
WML  expression  has  a  corresponding  TYPE.  The  extension  of  a  WML  expression  must  be  a  member  of  the 
WML's  type.  For  example,  the  type  of  the  expression  with  branch  category  EXISTS  above  is  TV  (for  truth  value), 
and  it  must  denote  either  TRUE  or  FALSE.  The  ty  pe  of 

(IOTA  ?  JX2  VESSEL  (NAMEOE  ?JX2  "Frederick")) 
is  VESSELS,  and  it  denotes  a  member  of  the  set  of  VESSELs. 

A  WML  expression  is  meaningful  it  it  has  a  TY'PE  that  is  not  NULL-SET.  (NULL-SET  a  special  type:  the 
type  of  meaningless  WML  expressions  )  Determining  the  type  of  an  expression  is  a  recursive  process,  it  is  a 
function  of  the  types  of  its  pans  Each  branch  category  has  a  corresponding  type  constructor  that  determines  how 
the  type  of  an  expression  with  that  branch  category  is  obtained  from  the  types  of  its  pans.  By  providing  a  principled 
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way  to  obtain  the  type  of  complex  WML  expressions,  we  are  given  a  mechanism  to  test  the  validity  of  an  expression 
and  the  range  of  its  possible  denotations. 

The  set  of  possible  types  TYPE  is  defined  (recursively)  as: 

•  TV.  INT,  T,  and  W  (for  truth  value,  integer,  time,  and  world,  respectively),  are  in  TYPE. 

•  Whenever  a  domain  model  concept  C  denotes  a  set  of  individuals.  CT  is  in  TYPE. 

•  Whenever  ta,  tb  are  in  TYPE,  then  (ta  tb)  is  in  TYPE. 

(ta  tb)  is  the  type  of  a  function  whose  domain  is  ta  and  range  is  tb,  i.e.,  one  can  construct  arbitrarily 
complex  functions. 

•  Whenevet-  tal,ta2,...,tan  are  in  TYPE,  then  (TUPLE-TYPE  tal,ta2 . tan)  is  in  TYPE. 

•  Whenever  tal,ta2,...,tan  are  in  TYPE,  then  (UNTON-TYPE  tal.ta2 . tan)  is  in  TYPE. 

•  Whenever  ta  is  in  TYPE,  then  (BAG-TYPE  ta).  (SET-TYPE  ta).  and  (ORDF.RED-BAG-TYPE  ta)  are 
in  TYPE. 

•  NULL-SET  is  in  TYPE,  it  is  the  type  of  denotationless  expressions. 


6.3.2  Meaningful  WML  Expressions 

This  section  formally  defines  the  set  of  meaningful  WML  expressions,  by  defining  all  valid  WML  branch 
categones.  and  giving  type  constructors  for  them.  Any  expression  not  satisfying  the  following  definition  is 
denotationless  (has  type  NULL-SET). 

Note: 

•  We  use  the  convention  that  parentheses  and  atoms  in  CAPITAL  LETTERS  should  be  taken  verbatim, 
as  quoted  expressions,  but  atoms  in  small  letters  denote  meta-vanables  which  stand  for  arbitrary 
expressions. 

•  TYPE-OF  is  a  lisp  function  which,  when  given  an  input  x.  "returns"  the  type  of  x  if  x  is  well-formed, 
and  NULL-SET  otherwise.  We  use  the  notation  :=  for  returns”.  (TYPE-OF  x)  :=  tx  means,  "when 
(.APPLY  #‘TYPE-OF  xi  is  executed,  it  returns  tx". 

•  Speech  acts  are  in  our  language,  and  are  used  in  the  interpretation  of  a  sentence.  They  are  special 
branch  categones.  The  WML  corresponding  to  an  utterance  will  always  have  the  form  (speech-act 
((INTENSION  x)  time  world).  The  set  of  speech  acts  includes  QUERY,  ASSERT,  and  BRING- 
ABOLT. 

1.  (TYPE-OF  vanablet  :=  tv,  where  tv  is  the  sort  in  the  expression  that  defines  the  scoping  environment 
for  the  variable. 

2.  (TYPE-OF  *T*)  :=T,  (TYPE-OF  *W*i  =  W.  *T*  and  *W*  are  special  variables  in  WML 

3.  (TYPE-OF  domam-model-role-coastanu  :=  ((TUPLE-TYPE  id  tr)  TV)  where  td  and  tr  are  the  domain 
and  range  ty  pes  of  the  role  in  the  domain  model 

4.  (TYPE-OF  (LAMBDA  (ui  p  b ) )  :=  (tu  tbl.  if  u  is  a  variable  of  ttpe  (u.  (TYPE-OF  p)  :=  ( tu  TV),  and 
(TYPE-OF  b)  :=tb. 

5.  (TYPE-OF  (fail  :=  tr.  if(TY'PE-OF  ft  :=  (td  tr  i  and  (TYPE-OF  a)  :=  ta.  and  the  intersection  of  td  and 
ta  is  non-empty . 

6.  (TYPE-OF  ( EQUAL  a  b»)  :=  TV.  if  (TYPE-OF  a  i  =  ta.  (TYPE-OF  b)  :=  tb,  anti  the  intersection  of  ta 
and  tb  is  non-empty 
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7.  (TYPE-OF  (PRED-TO-SET  p»  :=  (SET-TYPE  ta).  if  (TYPE-OF  p)  :=  (ta  TV). 

8.  (TYPE-OF  (CARD  s))  :=  INT,  if  (TYPE-OF  s  I  :=  (SET-TYPE  ta). 

9.  (TYPE-OF  (SET-OF  al  a2  ...  an))  :=  (SET-TYPE  (UNION-TYPE  tal  ta2  ...  tan)),  if  (TYPE-OF  al )  := 

tal,  (TYPE-OF  a2)  :=  ta2 . and  (TYPE-OF  an)  :=  tan. 

10.  (TYPE-OF  (FORALL  u  p  b»  :=  TV.  and  (TYPE-OF  (EXISTS  u  p  b))  :=  TV,  if  u  is  a  variable  of  type 
tu.  (TYPE-OF  p)  :=  (tu  TV),  and  (TYPE-OF  b)  :=  TV. 

1 1.  (TYPE-OF  (IOTA  u  p  b))  :=  tu,  if  u  is  a  vanable  of  type  tu,  (TYPE-OF  p)  :=  (tu  TV),  and  (TYPE-OF 
b)  :=TV. 

12.  (TYPE-OF  (TUPLE-OF  n  al  a2  ...  an))  :=  (TUPLE  n  tal  ta2  ...  tan),  if  (TYPE-OF  al)  :=  tal. 

(TYPE-OF  a2)  :=  ta2 . (TYPE-OF  an)  :=  tan. 

13.  (TYPE-OF  (UNION  p)>  :=  (ta  TV),  if  (TYPE-OF  p)  :=  ((ta  TV )  TV). 

14.  (TYPE-OF  (POWER  p))  :=  ((taTV)  TV),  if  (TYPE-OF  p)  :=  (ta  TV ). 

15.  (TYPE-OF  (APPLY-COLLECT  f  si)  :=  (BAG-TYPE  tr),  if  (TYPE-OF  s)  :=  (SET-TYPE  ta)  or 
(TYPE-OF  s)  :=  (BAG-TYPE  ta).  (TYPE-OF  f)  :=  (td  tr).  and  the  intersection  of  td  and  ta  is  non¬ 
empty. 

16.  (TYPE-OF  (MEMBER  as))  :=  TV.  if  (TYPE-OF  s)  :=  (SET-TYPE  ta)  and  (TYPE-OF  a)  :=  ta.. 

17.  (TYPE-OF  (BAG-TO-SET  b)l  :=  (SET-TYPE  tb>.  if  (TYPE-OF  b)  :=  (BAG-TYPE  tb). 

18.  (TYPE-OF  (FIRST  n  o-bag))  :=  (ORDERED-BAG-TYPE  ta)  if  (TYPE-OF  n)  :=  INT  and  (TYPE-OF 
o-bag >  :=  (ORDERED-BAG-TYPE  ta) 

19.  (TYPE-OF  (SORT  set  fun))  :=  (ORDERED-BAG  ta)  if  (TYPE-OF  set)  :=  (SET-TYPE  ta).  (TYPE-OF 
fun)  :=  (td  tr).  and  the  intersection  of  td  and  ta  is  non-empty. 

20.  (TYPE-OF  (ORDERED-BAG-TO-SET  o-bag))  :=  (SET-TYPE  ta)  if  (TYPE-OF  o-bag)  := 
(ORDERED-BAG-TYPE  ta). 

21.  (TYPE-OF  (SKOLEM  n  p))  :=  ta  if  (TYPE-OF  n)  :=  INT.  and  (TYPE-OF  p)  :=  (ta  TV) 

22.  (TYPE-OF  (AND  fl  f2)h  (TYPE-OF  (OR  fl  f2)).  (TYPE-OF  (IMPLIES  fl  12)),  (TYPE-OF  (IFF  fl 
f20)),  (TYPE-OF  (NOT  fl)).  (TYPE-OF  (NECESSARILY  fl)).  and  (TYPE-OF  (POSSIBLY  fl)i  := 
TV,  if  (TYPE-OF  fl )  :=  TV  and  (TYPE-OF  12 1  :=  TV. 

23.  (TYPE-OF  (INTENSION  a))  :=  (W  (T  ta)).  if  (TYPE-OF  a)  :=  ta. 

24.  (TYPE-OF  (EXTENSION  i)):=ta.  if  (TYPE-OF  l  ):=  (W  (T  ta)). 

25.  (TYPE-OF  (<tense-operator>  i  )) )  :=  ta.  where  <tense-operator>  is  one  of  PRESENT.  PAST, 
FUTURE.  kRESPERF,  and  PASTPERF,  if  (TYPE-OFi)  :=  (W  (T  ta)). 

26.  (TYPE-OF  (G  i ))  :=  (ta  TV),  if  (TYPE-OF  i)  :=  (W  (T  (ta  TV))),  and  ta  is  a  type  of  individuals  (a 
domain  concept  type). 

27.  {  (YPE-OF  (SAMPLE  D)  :=  (ta  TV),  it  (TYPE-OF  i)  :=  (W  <T  (ta  TV'))).,  and  ta  is  a  type  of 
individuals. 

28.  (TYPE-OF  (KIND-OF  li)  :=  ta.  if  (TYPE-OF  n  :=  (W  (T  (ta  TV))),  and  ta  is  a  type  of  individuals 

29.  Nothing  else  is  a  WML  expression 
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7.  Users'  Guide  to  KNACQ  in  IRUS-II 

7.1  Purpose  of  KNACQ 


This  chapter  describes  the  component  KNACQ  (KNowledge  ACQuisition)  [33]  of  BBN's  IRUS-II  natural 
language  system.  This  component  greatly  eases  the  task  of  defining  the  lexical  semantics  for  words  in  a  new 
domain.  KNACQ  assumes  a  predefined  domain  model  that  describes  in  the  NIKL  Knowledge  Representation 
language  the  logical  concepts  (unary  predicates)  and  roles  (binary  predicates)  that  are  relevant  to  the  domain 
KNACQ  annotates  that  domain  model  with  information  about  how  those  concepts  and  roles  can  be  referred  to  in 
natural  language.  It  acquires  that  information  for  each  concept  and  role  from  a  system  developer. 

KNACQ  records  various  kinds  of  data  about  the  natural  language  expressions  related  to  each  concept  in  the 
domain  model.  The  simplest  kind  of  knowledge  acquired  by  KNACQ  is  the  NL  terms  that  can  be  used  to  refer 
directly  to  instances  of  the  concept,  for  example,  that  instances  of  the  vessel  class  ”CG”  can  be  referred  to  as 
“cruisers". 

Much  of  KNACQs  power  comes  from  a  second  class  of  data  that  captures  and  exploits  the  common  elements 
found  in  the  many  different  ways  in  which  the  relational  nouns  that  describe  attributes  of  classes  can  be  used  in 
database  contexts  For  example,  KNACQ  acqufrc..  from  tlie  developer  the  information  that  the  “speed-of  role 
connecting  “vessels"  to  “speed-measures”  can  be  referred  to  by  the  relational  noun  “speed”.  Using  that  data,  the 
names  for  rite  con^pis.  and  uic  fact  that  “speed-measures”  are  defined  as  “one-dimensional-scalars”  in  the 
domain  model,  KNACQ  then  makes  it  possible  for  IRUS-II  to  correctly  interpret  a  great  variety  of  expressions 
referring  to  vessel  speeds,  including: 

•  the  speed  of  the  cruisers 

•  the  vessel's  speed 

•  the  Vinson  has  a  speed  of  20  knots 

•  vessels  with  a  speed  of  less  than  20  knots 

•  the  fastest  vessel 

KNACQ  thus  makes  available  a  great  deal  of  linguistic  power  based  solely  on  the  names  given  to  concepts  and 
roles. 

Other  portions  of  KNACQ  cover  phenoma  that  do  not  fit  the  attribute  pattern.  For  example,  a  more  genera! 
scheme  called  caseframe  rules  handles  lexical) /.ations  of  roles  that  use  different  prepositions  or  reference  patterns 
than  apply  to  attributes  in  general  For  example,  suppose  that  ^  'casualty  report"  can  be  said  to  he  “from”  a 
"military  unit",  where  the  prepositional  object  specifies  the  filler  of  the  ”unit-of"  role  for  the  “casualty  report" 
concept  Note  that  this  example  inverts  the  usual  pattern  from  prepositional  phrases  embody  ing  relational  nouns: 
"the  unit  of  report-36",  like  "the  speed  of  Frederick",  is  the  normal  relational  case,  where  the  object  of  the 
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prepositional  phrase  is  the  primary  concept,  the  domain  of  the  role  in  question,  but  in  "the  report  from  unit- 12  ",  the 
main  noun  gives  the  primary  concept,  while  the  object  of  the  preposition  specifies  the  relauonship  by  specifying  its 
filler.  Caseframe  rules  provide  for  such  more  general  lexicalization  patterns  for  roles. 

Special  KNACQ  code  also  handles  those  gradable  adjectives  whose  meaning  is  denved  from  the  value  of  a 
single  role,  like  "fast”,  in  the  sense  of  having  a  greater  speed  than  another  vessel  or  than  a  given  cutoff  value. 

The  KNACQ  data  acquired  from  the  system  developer  is  stored  as  annotations  attached  to  the  NIKL  domain 
model.  When  that  model  is  in  turn  loaded  into  [RUS-II,  those  annotations  are  convened  into  semantic  fields  in  the 
definitions  of  the  given  words  that  allow  IRUS-II  to  parse  sentences  containing  them.  The  KNACQ  acquisition 
process  is  highly  interactive,  and  is  implemented  as  an  extension  to  the  KREME  knowledge  base  editing  system, 
which  allows  the  developer  to  explore  the  NIKL  network  graphically  and  edit  it  incrementally.  Note,  however,  that 
the  connection  between  KNACQ  and  IRUS-II  is  currently  implemented  by  having  KNACQ  write  out  a  complete 
version  of  the  domain  model,  including  the  KNACQ  annotations,  and  then  having  ERUS-I1  read  and  process  that 
file.  Thus,  domain  model  changes  or  KNACQ  additions  will  not  take  effect  in  IRUS-II  until  that  file  writing, 'reading 
is  done.  There  are  plans  to  make  this  connection  more  fully  incremental  in  the  future. 

While  the  various  facilities  of  KNACQ  go  a  long  way  toward  speeding  up  the  acquisition  of  lexical  semantics 
for  new  domains,  they  do  not  do  the  whole  job.  KNACQ  is  attribute  treatment  handles  that  portion  of  noun  phrase 
semantics  that  depend  on  highly-predictable  patterns.  Though  caseframe  rules  allow  for  coverage  of  simple  non- 
auibctc  *TPs,  ihc  oemanti''  entires  for  more  complex  noun  phrase  elemep's  and  for  verbs  still  need  to  be  entered 
directly,  by  creating  IRULES  for  each  sense  and  attaching  those  IRULES  to  the  words  involved.  The  IRACQ 
component  of  IRUS-II  provides  an  example-based  scheme  for  acquinng  verb  semantics.  Note  that  KNACQ's  initial 
coverage  of  noun  phrase  semantics  contnbutes  in  turn  to  using  IRACQ  for  verb  phrase  semantics,  since  it  provides  a 
wide  range  of  noun  phrases  for  use  in  the  example  sentences  that  IRACQ  uses.  Together.  KNACQ  and  IRACQ 
allow  rapid  bootstrapping  acquisition  of  the  semantics  for  new  domains. 


7.2  Adding  KNACQ  Data  to  the  Domain  Model 


The  KNACQ  interface  for  adding  or  changing  the  data  associated  with  a  given  concept  or  role  is  implemented 
as  supplementary  functionality  added  to  the  KREME  knowledge  representation  editing  system  (The  files 
implementing  these  additions  are  termed  the  KIT  system,  for  KNACQ  User  Interlace.)  To  use  KNACQ,  use 
Select-K  to  access  KREME.  mouse  the  "parameters”  button  in  the  "main  commands”  menu  and  select  KNACQ  as 
the  “language  mode".  (KNACQ  mode  in  KREME  is  a  submode  of  NIKL  mode,  if  it  does  not  seem  to  be  listed  as 
an  option,  make  sure  that  the  MKL-COMP AT ABILITY  and  KUI  systems  are  loaded.)  Then  choose  the  main 
command  Toad  network",  entenng  the  names  of  the  file  or  files  containing  the  NIKL  domain  model.  (To  enter 
more  that  one  file  name,  type  a  single  comma  by  itself  alter  each  file  name  except  the  last.)  After  the  network  has 
been  loaded  and  classified,  the  KREME  command  "edit  concept",  given  a  concept  name,  will  bring  cp  three 
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windows:  a  graph,  showing  that  concept  with  its  ancestors  and  descendents,  a  window  describing  that  concept's 
definition,  and  a  window  listing  the  roles  for  w  hich  that  concept  is  either  the  domain  or  a  subconcept  of  the  domain. 
What  you  will  actually  see  in  response  to  "edit  concept"  is  a  small  box.  indicating  the  system's  request  for  a 
location  for  the  new  collection  of  three  windows.  Moving  the  mouse  will  move  the  box  around:  moving  the  mouse 
while  the  shift  key  is  held  down  will  move  just  the  lower  right  comer,  allow  ing  resizing  of  the  space  for  the  three 
new  windows.  These  windows  can  also  be  resized  and  moved  after  being  displayed  through  menus  accessible  by 
mousing  right  on  their  top  label  lines. 

Following  the  usual  KREME  conventions,  mousing  right  on  either  the  icon  or  name  for  a  concept  or  role  pops 
up  a  menu  of  operations  on  that  object;  one  of  those  options  in  turn  requests  a  menu  of  KNACQ  operations. 
Concepts  can  be  grabbed  in  this  way  from  the  graph  or  the  description  windows:  roles  are  most  easily  accessed  in 
the  list  of  slots  window,  but  note  that  the  mouse  must  be  on  the  name  of  the  role  itself,  and  not  on  the  whole  row  m 
the  table. 

While  in  KNACQ  mode,  the  KREME  display  is  modified  to  indicate  the  presence  of  KNACQ  data  attached  to 
roles  and  concepts,  to  make  it  easier  to  see  when  the  domain  model  has  been  fully  covered  by  KNACQ  data  For 
concepts,  the  necessary  KNACQ  data  is  limited  to  natural  language  terms  that  can  be  used  to  refer  to  instances  of 
that  concept;  an  elliptical  icon  is  used  for  concepts  that  have  such  KNACQ  vocabulary  defined,  while  the  normal 
KREME  rectangular  icon  is  used  for  concepts  with  no  vocabulary'.  Roles  may  carry  not  only  vocabulary 
information  refemng  to  their  attnhuuve  use  but  also  the  more  general  caserule  specifications  for  other  kinds  of 
natural  language  reference  to  the  given  role  and  specifications  for  gradeable  adjectives  based  on  the  role. 
Information  on  the  KNACQ  data  attached  to  roles  is  included  in  the  list  of  roles  (called  "slots"  in  KREME 
parlance).  Columns  there  show  the  number  of  vocabulary  items  defined  as  attribute  names  for  the  role,  the  number 
of  adjectives,  and  whether  or  not  caserule  specifications  are  defined.  While  the  list  of  slots  window  shows  all  slots, 
both  those  locally  defined  at  this  concept  and  those  inherited  from  a  superconcept,  the  columns  showing  what 
KNACQ  data  has  been  defined  only  apply  to  slots  that  are  defined  locally  To  see  that  display  for  inherited  slots, 
move  up  instead  to  editing  the  concept  at  which  they  are  defined. 

A  user  wishing  to  enter  KNACQ  data  for  a  new  domain  model  may  choose  to  do  so  in  any  order.  One 
reasonable  approach  is  to  move  from  concept  to  concept  through  the  network,  adding  both  concept  vocabulary  for 
each  concept  and  role  information  for  each  of  the  roles  attached  to  it,  working  in  this  w  ay  through  all  the  concepts. 
Concepts  not  yet  handled  are  obvious,  since  they  display  as  rectangular-shaped  icons  Mousing-nght  on  a  role  in  the 
list  of  slots  window  bnngs  up  a  menu  of  commands  to  add  to  or  modify  its  KNACQ  data.  As  an  alternative,  the 
command  "KNACQ  process  concept"  in  the  KNACQ  menu  for  concepts  will  go  through  the  steps  for  acquinng 
first  vocabulary  for  the  concept  and  then  attribute,  caserule,  and  adjective  information  for  all  the  roles  attached  at 
that  concept. 

Because  the  slots  display  window  shews  counts  of  the  KNACQ  data,  its  contents  are  normally  redisplayed 
each  time  the  user  adds  to  or  changes  the  KNACQ  data  for  a  slot.  The  same  holds  for  the  window  where  the 
definition  of  the  current  concept  is  displayed,  since  it  displays  the  now  changed  internal  form  of  the  KNACQ  data  as 
user  attached  data  If  this  redisplaying  proves  annoyingly  slow,  setting  the  flag  kui::*refresh-p*  to  nil  will  cause 
the  slot  s  window's  contents  to  be  refreshed  only  wlien  the  entire  window  is  redisplayed 
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The  following  sections  describe  in  more  detail  the  different  classes  of  data  KNACQ  requires. 

7.2. 1  Terms  for  Concepts 

For  each  concept.  KNACQ  can  record  a  set  of  terms  (usually  common  nouns)  that  can  be  used  to  refer  to 
entities  belonging  to  that  class.  For  example,  the  concept  “destroyer-class-vessel”  might  cany  KNACQ  data  noting 
that  any  instance  of  that  class  can  be  referred  to  as  a  “destroyer”  and  perhaps  also  as  a  “tin  can"  (Navy  slang). 
Because  NIKL  inhentance  also  specifies  that  “destroyer-class-vessel"  is  a  subclass  of  “vessel",  KNACQ  realizes 
that  terms  like  “ship”  and  “boat”  that  are  attached  to  the  concept  “vessel”  can  also  apply  to  “destroyers”. 

For  each  such  term,  KNACQ  also  records  inflectional  information  showing  how  it  can  be  made  plural 
KNACQ  will  accept  a  set  of  vocabulary  items,  terminated  by  an  empty  carriage  return,  and  it  then  quenes  for  the 
inflection  of  each  item.  Setting  the  flag  kui::*plural-query-p*  to  nil  w  ill  suppress  the  user  queries  about  inflection, 
leaving  KNACQ  to  depend  on  its  own  heunstics  for  guessing  the  correct  plural.  KNACQ  allows  the  use  of  multiple 
word  entries,  like  “tin  can  for  destroyer,  althougn  me  facility  is  not  fully  general,  since  it  assumes  that  inflection 
affects  only  the  last  word  in  the  sequence. 

7.2.2  Terms  for  Attribute  Roles 

The  pnmary  information  stored  by  KNACQ  about  roles  is  tne  vocabulary  used  to  describe  their  use  a.\ 
attnbutes.  For  example,  the  “speed-of”  role  connecting  “vessels”  to  “speeds”  can  be  referred  to  by  the  attribute 
terms  “speed”  and  “velocity”. 

Note  that  KNACQ  information  can  be  attached  to  slots  either  (as  is  typically  done)  at  their  defining  concepts 
or  (when  desired)  at  a  concept  that  merely  inhents  the  slot.  The  latter  is  appropnate  when  an  Nl.  temi  can  be  used 
to  descnbe  a  slot  only  in  relation  to  a  particular  class  of  entities.  For  example,  while  all  “physical-objects”  might 
be  defined  to  have  a  “width"  slot,  the  term  “beam"  is  used  to  descnbe  that  slot  only  when  speaking  of  ships:  thus 
“beam”  would  be  attached  as  a  KNACQ  temi  for  the  “width"  slot  at  the  concept  “vessel".  Because  there  is  this 
option  of  attaching  KNACQ  information  to  slots  at  concepts  that  inherit  rather  than  define  them.  KNACQ  operations 
on  slots  first  ask  for  a  concept  name  to  specify  the  concept  with  respect  to  which  this  term  is  to  be  used.  The  default 
answer  for  that  question  is  always  the  defining  concept,  which  is  usually  also  the  appropnate  answer. 

With  the  related  concept  established,  the  user  can  then  enter  attribute  terms  for  the  slot.  Inflectional 
mformaUon  is  collected  for  these  attribute  terms,  in  the  same  way  as  for  concept  vocabulary. 

KNACQ  also  makes  it  possible  to  record  families  of  attribute  names,  using  “attnbute  modifiers",  for  cases 
where  adjectives  modifying  the  name  cause  it  to  refer  to  a  different  role.  For  example,  “speed"  by  itself  might  act 
as  a  nanw*  for  »h#>  role  “ vessel -speed-cf”  while  the  modified  names  “maximum  speed"  and  “design  speed"  act  as 
names  for  the  separate  rules  “vessel-maximum-speed-of '  and  '  vessel-design-speed-of".  KNACQ  allows  such 
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modified  roles  to  be  entered,  along  with  the  roles  to  which  they  actually  refer.  (Note,  however,  that  the  modifying 
words  like  "maximum"  or  "design"  need  to  be  themselves  defined  syntactically  as  words  for  this  facility  to  work. 
KJSACQ  was  not  designed  to  do  this  automatically  because  it  was  not  clear  that  their  syntactic  type  could  be  safely 
guessed  in  all  cases.  Also  note  that  such  modifiers  are  stored  at  the  role  named  by  the  main  attribute  term, 
"vessel-speed-of  in  the  example,  and  not  at  the  roles  that  the  modified  attributes  end  up  referring  to.) 

7.2.3  Caseframe  Patterns  for  Roles 

While  the  attribute  style  covers  a  large  percentage  of  the  ways  in  which  role  data  can  be  referred  to  in  NL,  a 
more  general  facility  in  KNACQ  known  as  "caseframe  rules"  allows  the  system  developer  to  select  from  a  menu 
exactly  the  appropnate  set  of  prepositional  phrases  or  nominal  compounds  by  which  the  given  role  can  be 
referenced  One  example  requiring  the  more  general  caseframe  approach  was  given  above,  where  "the  repon  from 
unit-12"  did  not  fit  the  pattern  of  "the  report's  unit".  In  that  example,  the  role  as  defined  in  the  domain  model  is  in 
the  opposite  direction  from  the  way  it  is  normally  referred  to.  Thus,  we  are  presuming  here  that  the  domain  model 
includes  a  role  from  casualty  reports  to  the  ship  that  filed  them.  Attribute  names  could  be  used  with  that  role  to 
handle  references  like  "the  ship  of  report-1 12".  However,  in  examples  like  "the  casualty'  report  of  the  Fox"  the 
attnbute  rule  approach  would  take  the  vessel  as  the  main  noun,  though  the  role  here  is  attached  to  the  report.  Thus 
caseframe  rules,  which  can  apply  in  either  direction,  must  be  used  in  order  to  connect  phrases  like  that  with  the 
underlying  role  from  reports  to  ships. 

Adding  a  caseframe  rule  is  one  of  the  options  on  the  menu  attached  to  roles.  Mousing  that  option  brings  up  a 
substantial  menu  of  possible  forms  of  reference  to  the  given  role.  For  example,  if  the  role  were  the  one  connecting 
ships  and  speeds,  the  menu  would  include  the  noun  phrases  "the  ship  speed"  and  "the  speed  ship"  along  with 
many  possible  prepositional  phrases,  including  "the  ship  among  the  speed"  and  "the  speed  among  the  ship",  "the 
ship  upon  the  speed"  and  "the  speed  upon  the  ship",  and  the  like.  Most  of  these  possible  expressions  will  not 
actually  be  ways  of  referring  to  that  role  in  English,  but  some  will,  and  the  user  should  select  the  ones  that  both 
sound  nght  and  could  reasonably  be  interpreted  as  referring  to  the  given  role.  For  prepositional  phrases,  the 
KNACQ  menu  also  makes  a  distinction  between  those  that  can  only  occur  as  part  of  the  main  NP  and  those  that  can 
also  occur  as  a  predicate  As  an  example  of  the  former  type,  the  phrase  "the  commander  of  Fredenck"  works, 
while  "the  commander  is  of  Fredenck"  does  not:  as  an  example  of  the  latter,  ‘‘the  Admiral  aboard  the  ship  ‘  and 
"the  Admiral  is  aboard  the  ship"  are  both  OK 
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7.2.4  (iradeable  Adjective  Terms 

Certain  domain  model  roles  like  the  speed  of  a  vessel  refer  to  scalar  functions  where  the  values  of  that  role  tor 
different  instances  of  the  concept  can  he  compared  like  grades  on  a  scale.  KNACQ  allows  the  developer  to  enter 
adjectives  that  can  refer  to  such  roles,  like  the  adjective  “fast"  for  speed.  For  each  gradeable  adjective.  KNACQ 
asks  for  the  root  form,  the  inflectional  information  (“fast,  faster,  fastest"),  and  who  ner  “X  is  faster  than  Y"  means 
that  X's  value  for  this  role  is  greater  than  or  less  than  Y’s. 

Such  gradable  adjectives  can  also  be  used  in  an  absolute  sense,  as  in  the  phrase  "a  fast  ship".  KNACQ 
implements  a  straightforward  scheme  whereb  the  developer  car.  enter  a  single  threshold  value  for  each  adjective 
stating  the  limit  above  or  below  which  instances  of  that  concept  or  subconcepts  can  be  said  to  satisfy  such  absolute 
uses  of  the  given  adjective.  The  user  ma\  enter  more  than  one  sense  of  "fast  ',  with  different  concepts  as  their 
domain  to  indicate,  for  example,  that  "a  fast  battleship  '  is  faster  than  a  fast  landing  craft' . 


7.3  Making  KN  ACQ  Data  Available  in  IRISH 


KNACQ  stores  the  information  it  learns  about  NL  ways  of  referring  to  particular  concepts  and  roles  bv 
attaching  data  to  the  DATA  slot  of  the  concepts  and  roles.  When  the  user  then  saves  the  network,  the  KNACQ  data 
is  saved  along  with  the  normal  N1KL  definitions  of  the  concepts  and  roles.  (If  the  network  is  reloaded  into 
KREME/KNACQ  in  a  later  session,  this  attached  data  is  used  to  reestablish  the  state  that  existed  when  the  network 
was  last  saved.)  When  this  annotated  form  of  the  NIKL  network  is  loaded  into  IRUS-II.  this  attached  data  is  loaded 
along  with  it  The  function  iprocess-domain-model-knacq-datai  then  uses  that  attached  data  to  denve  IRUS-II 
syntactic  and  sematic  entries  for  words  in  the  IRUS-II  dictionary. 

This  process  ot  transferring  data  acquired  b\  KNACQ  to  IRUS-II  currently  works  only  for  the  entire  domain 
model  at  once  Further  changes  made  in  KNACQ  will  not  become  effective  in  IRUS-II  until  the  next  time  the  entire 
network  is  saved  out  from  KNACQ  and  reloaded  into  IRUS-II. 
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8.  Access  to  Multiple  Underlying  Systems  in  Janus 


8.1  Introduction 


As  computer  systems  become  more  complex,  there  is  more  opportunity  for  combining  the  strengths  of  more 
than  one  system  in  order  to  perform  a  task.  For  example,  one  might  imagine  combining  several  resources:  a 
database  for  storing  relational  information  with  an  applications  program  to  perform  calculations  based  on  that 
information,  an  expert  system  to  perform  inferences,  and  a  display  system  to  present  data  in  a  useful  way.  In  such 
an  environment  a  "seamless”  natural  language  interface  can  become  a  very  effective  tool,  allowing  the  user  to 
retneve  and  manipulate  information  without  needing  to  pay  attention  to  the  details  of  any  particular  resource. 

The  back-end  of  such  an  interface  however,  is  necessarily  more  complex:  not  only  must  it  be  able  to  translate 
the  user's  request  into  executable  code,  but  it  must  also  be  capable  of  organizing  the  vanous  resources  at  its 
disposal,  choosing  w  hich  combination  of  resources  to  use,  and  supervising  the  transfer  of  data  among  them.  We  call 
this  the  multiple  underlying  system ,v  (MUS*  problem.  This  chapter  describes  the  critical  information  needed  for  the 
implemented  MJS  component  that  is  pan  of  the  back  end  of  the  Janus  natural  language  interface. 1 


8.2  The  Type  System 


The  syntax  of  WML  is  "modelled  after  languages  of  the  typed  lambda  calculus"  (  [10].  p.  27).  The  import  of 
this  is  perhaps  most  concisely  expressed  in  (29|: 

Each  type  expression  [or  type)  is  associated  with  a  set  of  entities  or  structures  which  is  termed  its  Jonuitn.  Even 
expression  of  ,e  language...  can  be  mapped  to  a  type  expression,  whose  domain  serves  to  delimit  the  range  of  values 
the  expression  can  take  on 

A  complete  description  of  the  type  system  associated  w  ith  WML  is  beyond  the  scope  of  this  document,  but  in  this 
section  we  attempt  to  convey  a  sense  of  it  by  means  of  examples. 

The  set  of  opes  has  two  major  subsets  —  those  that  are  independent  of  the  domain,  and  those  that  are  specific 
to  the  domain  The  former  set  includes  the  types  TV  (truth- value).  INTEGERS.  STRINGS.  REALS.  TIMES,  and 
WORLDS;  the  latter  consists  of  types  constructed  trom  concepts  and  roles  in  the  domain-model  For  example,  if 
VESSEL  and  LOCATION'  are  domain-model  concepts  and  SHIT-LOCATIOS  and  COMBAT-READY  are  domain 
roles,  then  some  related  types  might  include: 


Hir  h.ip'-r  ! '  *■-  JM-.  ";t  *hr  in*  'rn.iti  Ti  'hi*  nniM  hr-  pr  a  i.lrU  in  .ipplv  mg  tilin',  to  .1  nf  w  .ippli,  .iti-.n  .iini  is  jn  .ihhrc  .1  iir.l  ■.  crsioii  ot  J  hij 
*  hwh  pr»’-  k1"n  uii 'r'*  A  t.iilf  i  urTic:. !.»!.« mi  ■  »t  ;hc  M  S  v  iriupnficnt 
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Denotation 


VESSEL 
(S  VESSEL) 

(TUPLE  VESSEL  LOCATION) 

(S  (TUPLE  VESSEL  LOCATION) ) 
(rUNC-TYPE  VESSEL  LOCATION) 
(FUNC-TYPE  VESSEL  TV) 


all  vessels 

all  sets  of  vessels 

all  ordered  pairs  of  vessel,  location 
all  sets  of  such  ordered  pairs 
functions  from  vessels  to  locations 
unary  predicates  on  vessels 


Here  we  present  some  types  together  with  examples  of  logical  subexpressions  having  those  types  t  that  is. 
(TYPEOF  expression )  -  type). 

1  type  VESSEL 

Vincennes 

(IOTA  ?JX1  VESSEL  (COMBAT-READY  ?JX1)) 

2.  type  (S  VESSEL) 

(SETOF  Vincennes  Kennedy) 

(SET  ?JX2  VESSEL  (COMBAT-READY  ?JX2) ) 

(IOTA  ?JX1  (POWER  VESSEL)  (COMBAT-READY  ?JX1) ) 

type  (FUNC-TYPE  VESSEL  LOCATION) 

SHIP-LOCATION 

(LAMBDA  (?JX3)  VESSEL  (VESSEL-LOCATION  ?JX3) ) 

type  (FUNC-TYPE  VESSEL  TV) 

COMBAT -READY 
(LAMBDA  (?  JX4 )  VESSEL 

(AND  (EQUAL  (VESSEL-LOCATION  ?JX4)  "HAWAII") 

(COMBAT-READY  ?JX4) ) ) 


8.3  Normalizing  WML  Expressions 

WML  input  expressions  are  simplified  and  normalized  before  they  are  further  processed  by  the  multiple 
underlying  systems  (ML'S)  component  This  simplification  process  has  five  stages. 

1  the  extensionali/ation  of  intensional  subexpressions  (for  further  information,  see  [20] ). 

2.  the  translation  of  the  entire  expression  into  a  modified  disjunctive  normal  form. 

3.  the  elimination  of  unnecessary  equivalences  ( for  further  information,  see  [20| ). 

4.  the  application  of  underlymg-system-independeni  rewrites,  and 

5.  the  use  of  printfunctions  for  improving  responses. 

These  stages  occur  sequentially,  and  --  as  the  system  is  currently  implemented  --  must  he  done  in  the  order  presented 
here  Stages  2.  4,  and  5  are  discussed  further  below 
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8.3.1  Disjunctise  Normal  Form 


The  second  stage  of  normalization  is  the  translation  of  the  extensionahzed  WML  expression  into  a  some*  hat 
simplified  logical  expression  in  a  modified  disjunctive  normal  form  (DNF). 

The  expression  is  translated  into  a  disjunctive  normal  form  for  two  main  reasons  We  normalize  the 
expression  (reducing  the  number  of  embedded  subexpressions,  for  example)  in  order  to  simplify  the  process  of 
matching  vanous  pieces  of  it  to  underlying  system  capabilities.  We  choose  to  use  a  disjunctive  nonnal  form 
because: 

•  In  the  simplest  case,  an  expression  in  disjunctive  normal  form  is  simply  a  conjunction  of  clauses,  a 
particularly  easy  logical  form  to  cope  with. 

•  Even  when  there  are  disjuncts,  each  can  be  individually  handled  as  a  conjunction  of  clauses,  and  the 
results  then  combined  together  via  union,  and 

•  Bringing  disjunctions  to  the  top  level  allows  patterns  to  match  in  many  cases  where  it  would  otherwise 
not  be  possible.  For  example,  given  the  (non-normalized)  pattern 

(AND  (OR  (IN. CLASS  ?JX1  SUBMARINE) 

(IN. CLASS  ?JX1  AIRCRAFT) ) 

(LENGTH  ?.JX1  ? JX2)  ) 

a  service  seeking  to  match  the  pattern 

(AND  (IN. CLASS  <x>  SUBMARINE) 

(LENGTH  <x>  <y>) ) 

could  not  match.  The  DNF.  on  the  other  hand, 

(OR  (AND  (IN. CLASS  ?JX1  SUBMARINE) 

(LENGTH  ?JXI  ? JX2 ) ) 

(AND  (IN. CLASS  ?JX1  AIRCRAFT) 

(LENGTH  ? J5C1  ?JX2))) 

allows  the  match  to  take  place,  by  keeping  the  relevant  information  together  In  a  disjunctive  normal 
form,  each  disjunct  effectively  carries  all  the  information  necessary  for  a  distinct  subquery 


A  standard  disjunctive  normal  form  is  a  disiunction  of  conjunctions  of  predicates  or  negated  predicates:  no 
variables  in  such  an  expression  are  explicitly  quantified,  and  all  are  assumed  to  he  implicitly  universally  quantified. 
Existentially  quantified  vanables  have  been  replaced  by  skolem  terms  denoting  some  individual  instantiation  of  the 
van  able. 


The  modified  disjunctive  normal  form  differs  from  a  standard  DNF  in  several  respects: 

•  There  is  a  response  clause  for  every  query:  that  is.  an  additional  predicate  whose  arguments  are  the 
variables  for  which  we  want  returned  values.  In  a  query  requesting  a  set  of  objects  te.g.  "Which  ships 
are  in  the  Indian  Ocean'.’"  >  the  argument  in  the  response  clause  w  ill  be  the  variable  denoting  the  set  in 
question  The  same  is  true  for  queries  requesting  individuals  (e  g  the  ship  whose  speed  is  30  knots  r 
the  resulting  logical  form  will  seek  a/' possible  individuals  that  meet  the  same  description. 

In  a  yes/no  or  existential  query  the  response  clause  will  contain  all  variables  in  the  query,  since  any 
instantiation  of  all  the  queiy  variables  means  an  affirmative  answer:  the  inability  to  find  any  such 
instantiation  means  an  answer  in  the  negative  In  such  cases,  a  particular  variation  on  the  response 
predicate  is  used  rather  than  using  the  special  predicate  KESPO,\SE.  we  use  the  special  predicate 
\  ALl  r.-r.XISIS-Krsro\SL  instead,  this  preserves  the  inhumation  that  the  query  "s  intent  is  to  find  out 
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if  values  exist,  rather  than  to  have  them  relumed.2 

•  All  functional  terms  must  appear  as  predicates:  if  P  is  a  binary  predicate  and  Q  is  a  unary  function,  then 
Pfx,  Q(y)i  must  appear  as  P(x.  z)  and  Q'(y.  z>,  where  Q'(y.  z)  is  true  iff  Q(y)=z.  For  example,  the 
clause 

(GREATER-THAN  ^EED-OF  ?uA1/  30) 
will  appear  as 

([AND]  (SPEED-OF'  ?JX1  ?JX2) 

(GREATER-THAN  ?JX2  30)) 

•  Similarly,  provision  is  made  for  complex  terms  like  database  aggregates;  for  example,  cardinality, 
average,  and  sum.  Such  complex  terms  may  only  appear  as  the  first  argument  to  a  special  predicate 
called  IS-TERM,  the  second  argument  is  always  a  variable  that  represents  the  term.  For  example,  if  the 
logical  expression  asks  for  the  cardinality  of  the  ships  in  the  Indian  Ocean,  we  would  use  the  following 
clause: 

(IS-TERM  #S (CONTEXT 

: OPERATOR  CARDINALITY 
: OPERATOR-VAR  ?  JX2 
; CLASS-EXP 

((IN. CLASS  ?JX2  SHIP) 

(SHIP -LOCATION  ?JX2  "INDIAN  OCEAN"))) 

?JX1) 

(RESPONSE  ?JX1) 

•  There  is  no  implicit  assumption  of  universal  quantification  for  unquantified  variables  --  expressions  in 
the  modified  DNF  may  contain  universal  quantification. 

•  Existential  quantifiers  are  removed,  not  by  replacing  existentially  quantified  vanables  with  skolem 
terms,  but  simply  by  removing  the  explicit  existential  quantification.  The  resulung  unquantified 
variables,  along  with  all  other  unquantified  variables  in  the  form,  are  considered  to  be  query  quantified. 

The  term  query  quantified  refen  to  vanables  for  which  we  would  like  to  get  all  possible  instantiations. 
Such  vanables  are  neither  existentially  quantified  (since  we’re  interested  in  all  instantiations)  nor 
universally  quanufied  (since  universal  quantification  has  no  notion  of  returning  values);  this  kind  of 
quantification  is  more  like  that  of  the  variables  in  a  PROLOG  expression. 

Notice  that,  because  universal  quantifications  are  net  removed,  there  is  no  need  to  skolemi/e 
existentially  quantified  vanables  appearing  within  the  scope  of  universal  quantifiers. 


8.3.2  System-independent  Rewrites 

At  this  stage  of  the  normalization  process,  the  system  permits  the  application  of  obligatory  rewrite  rules 
These  rules  must  be  independent  of  the  underlying  systems:  both  pattern  and  result  must  consist  of  domain-model 
tnfonnauon,  and  they  may  not  contain  any  references  to  structures  or  data  in  the  underlying  system)  s). 

Rewrite  patterns  may  seek  to  match  both  simple  clauses  ( i.e. .  those  that  are  not  contexts),  and  context-clauses 
Similarly,  results  may  be  either  contexts  or  simple  clauses.  For  example,  the  following  rewrite  might  be  used  if  it 
was  known  that  the  number  of  subordinates  of  a  manager  corresponded  to  the  number  ot  employees  in  a  manager’s 
department 

“A  cooperative  system  may  still  return  the  vjjuc  thy  exist;  !>ir  example.  Are  any  ships  Cl  might  lead  t<»  the  response  )\s  the  (7  sh.p. 
arc 
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(def ine-aimple-rewrite3 

: pattern  ((in. class  x  manager) 

( : context 

: operator  CARDINALITY 
:class-exp  ((in. class  y  person) 

(subordinate-to  y  x) ) ) ) 

.-result  ((in.  class  x  manager) 

(department -of  x  z) 

(employee-count  z  y) ) ) 

By  using  this  rewrite  rule,  we  transform  a  query  in  which  one  actually  counts  elements  in  a  set  (via  the 
cardinality  term)  into  one  in  which  a  single  table  lookup  is  used  instead. 

8.3.3  Printfunctions 

Often  the  logical  content  of  a  query  does  not  reflect  its  desired  interpretation.  For  example,  a  query  as  simple 
as  "List  the  cruisers,"  if  interpreted  literally,  produces  a  listing  of  the  database's  internal  representation  for  each 
cruiser.  In  the  Navy's  IDB  domain,  this  representation  is  a  number  called  an  IUED  -  a  number  that  is  almost  certain 
to  be  completely  useless  to  the  user  as  a  means  of  ship  identification.  What  one  would  really  like  is  for  the  system 
to  be  smart  enough  to  inteipret  the  question  as  "List  the  names  of  the  cruisers."  Printfunctions  provide  just  that 
functionality. 

The  pnntfunction  machinery  is  quite  simple.  With  certain  classes  of  objects  (e.g..  the  domain-model  concept 
VESSEL)  one  associates  a  specification  for  how  members  of  that  class  should  be  presented  to  the  user,  called  a 
printfunctions  list.  Each  element  of  the  printfunctions  list  (each  printfunction )  is  either  (1)  the  name  of  a  domam- 
model  role,  or  (2)  the  special  symbol  IDENTITY  As  a  postprocessing  step  of  the  normalization,  the  variables  on 
the  response  list  are  examined,  and  a  nev.  response  list  created. 

Printfunctions  are  inherited  -  in  the  examples  in  the  following  section,  responses  involving  ship  classes  like 
cruisers  and  aircraft-carriers  are  always  expressed  as  responses  involving  the  names  of  the  ships  because  the  class 
VESSEL  (the  top-level  class  for  ships)  has  the  pnntfunctron  list  tNAMEOF)  associated  with  it. 


In  simple  rewrites,  x.  s.  z  and  v.  ar?  always  .arubies 
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8.4  Servers  and  Services 


In  an  environment  with  multiple  underlying  systems,  one  must  have  a  uniform  way  to  describe  the  capabilities 
of  each  underlying  system.  We  adopt  terminology  similar  to  that  of  [12]  and  [17]. 

A  server  is  a  functional  module  typically  corresponding  to  an  underlying  system  or  a  major  pan  of  an 
underlying  system.  In  the  application  of  the  MUS  system  being  described  here,  there  are  two  servers  —  one  named 
:ERL,  which  supports  access  to  a  relational  database,  and  one  called  :LISP,  which  supports  calls  to  arbitrary  LISP 
functions.  Each  server  has  associated  with  it: 

1.  A  number  of  senices :  objects  describing  a  particular  piece  of  functionality  provided  by  a  server. 
Specifying  a  service  in  MUS  provides  the  mapping  from  fragments  of  logical  form  to  fragments  of 
underlying  system  code 

2.  An  execution  planner,  a  function  that  takes  a  piece  of  the  solution  to  a  query  and  builds  from  it  a 
partial  execution  plan 

3.  An  executor :  a  function  that  takes  a  partial  execution  plan  together  with  input  data,  executes  the  plan, 
and  produces  output  data  (see  section  8  5  j. 

A  service  is  an  object  consisting  of  the  following  components: 

•  Name:  a  symbol  used  to  uniquely  identity  the  sen  ice 

,  •  Owner:  lame  of  the  server  to  which  this  service  belongs. 

•  Cost:  a  scalar  value  indicating  the  cost  of  this  sen  ice;  if  unspecified,  unit  cost  ( 1 )  is  assumed. 

•  Inputs:  a  list  of  pattern  vanables,  each  of  which  has  associated  w  ith  it  a  name,  a  type,  and  a  constraint 
The  type  indicates  the  extent  to  which  the  input  is  optional:  a  type  of  :GEN  indicates  that  input  to  this 
variable  is  optional,  since  this  semce  can  generate  values  for  the  variable,  a  type  of  TEST  indicates 
that  input  must  be  provided  for  the  variable,  since  this  service  is  only  capable  of  applying  some  test  to 
the  input  values;  a  type  of  :TEST-ALL  indicates  not  only  that  input  to  this  vanable  is  obligatory  ,  but  that 
by  the  time  the  data  for  this  variable  reaches  this  service  it  must  be  filtered  as  completely  as  possible  - 
this  is  often  the  type  for  inputs  to  services  that  do  response  presentation,  for  example. 

The  constraint  associated  with  the  vanable  is  used  for  pattern-matching.  The  possible  constraints 
include 

1.  (symbol* ):  a  list  of  symbols.  Items  matching  this  vanable  must  be  EQ  to  a  symbol  on  the  list. 

2.  (string*)-,  a  list  of  strings.  Items  matching  this  vanable  must  be  STRING=  to  a  string  on  the 
list. 

3.  type',  a  simple  type  .Am  item  will  match  this  variable  if  the  type  (i.e.,  type-system  type  -  see 
section  S.2 )  of  the  item  is  a  subty  pe  of  npe.  (The  type  of  a  Janus  variable  v  is  type  if  the  clause 
t/H. CLASS  \  type:  appears  in  the  (normalised)  query.)  This  constraint  does  not  pay  attention  to 
whethr  or  not  a  type  denotes  a  set  —  if  npe  is  (S  SHIP)  (a  set  of  ships),  an  item  w  ith  type  SHIP 
will  match,  and  vice-versa. 

4.  function:  a  function  than  takes  one  argument  An  item  item  w  ill  match  this  vanable  if  (funcatl 
function  item)  returns  a  nnn-NJL  value. 

5.  (SUBTYPE. -OF  t\pe>:  a  subtype  specification  Items  matching  this  vanable  must  themselves  he 
types  in  the  npe  system:  furthermore,  they  must  be  subtypes  of  type.  For  example,  a  variable 
with  constraint  (SUBTTPE-t)F  SHIP)  would  match  CRUISER  (since  CRUISER  is  a  subtype  of 
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SHIP).  Notice  how  this  differs  from  (?).  above:  there  the  constraint  is  that 

(SUBTYPE  fTYPEOF  itemt  type)  must  be  true,  whereas  here  the  constraint  is  that 
(SUBTYPE  item  type)  must  be  true. 

6.  NIL,  ANYTYPE.  T.  these  will  match  anything. 

•  Outputs:  a  list  of  pattern  variables,  identifying  the  outputs  of  the  service.  Outputs  need  not  have  been 
inputs,  nor  must  inputs  to  the  sen  ice  also  be  outputs. 

•  Pattern:  a  pattern  specification  which  will  match  some  piece  of  the  logical  form.  The  pattern 
specification  must  be  a  list,  each  element  of  which  is  the  pattern  for  either  a  simple  clause  or  for  a 
context-clause.  Within  patterns,  one  can  not  have  a  variable  predicate:  however,  the  arguments  to 
predicates  must  be  pattern  variables  (see  inputs,  above,  for  a  description  of  how  to  constrain  what  these 
variables  may  match). 

A  pattern  specification  for  a  context-clause  ( context-spec )  takes  one  ot  two  forms. 

( : contaxfc  .‘operator  operator 

[ :  free-vars  (var*)  ] 

[  : operator-var  rur] 

[:stat-var  vur] 

[:  class-exp  expression] 
t :  constraints  expression J) 

For  this  form  of  pattern  specification,  the  context-spec's  operator  must  match  the  context's  operator, 
and  recursive  calls  to  the  matcher  must  return  successfully  for  the  :dass-exp  and  xonstraints. 

(•.context  :  operator  operator 

[ :  covers-owner  senermame ] ) 

This  second  form  of  pattern  specification  allows  one  to  say.  "This  service  will  match  any  context  whose 
operator  is  operator,  as  long  as  there  are  solutions  of  the  :class-exp  subexpression  and  of  the 
constraints  subexpression  such  that  both  solutions  belong  entirely  to  server  sener-name."  For 
example,  a  context-spec  for  operator  CARDINALITY’  specifying  that  it  covers  owner  :ERL  says,  in 
effect,  "This  service  can  take  the  cardinality  of  any  set,  as  long  as  that  set  can  be  obtained  entirely  by 
calls  within  the  :ERL  server."  This  is  a  useful  method  of  providing  general  services  that  handle 
aggregate  operations  w  ithin  a  single  server 

•  Method:  a  code  fragment  or  other  information  that  the  server  will  use  in  generating  a  partial  execution 
plan  from  a  solution  that  utilizes  tin.,  service.  What  goes  in  the  method  slot  depends  entirely  cn  the 
particular  server  to  which  the  service  belongs. 


For  fast  access,  services  are  indexed  by  the  predicates  in  their  pattern.  That  is.  for  every  clause  tP  x  w  in  the 
pattern  of  some  service  S,  there  is  a  pointer  from  the  symbol  P  to  the  serv  ice  S.  An  excepuon  to  this  is  the  IN. CLASS 
predicate:  if  a  service’s  pattern  includes  (IN. CLASS  x  C).  the  pointer  w,U  be  from  the  symbol  C  rather  than  the 
symbol  IN.CL\SS:  that  ts.  the  indexing  proceeds  as  if  the  clause  were  G  w. 

As  an  example,  consider  the  service-object  corresponding  to  the  ERL  server’s  ability  to  access  a  table 
associating  ships  with  overall  combat-readiness  \alues: 
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NAME: 
OWNER: 
INPUTS : 
OUTPUTS : 
PATTERN : 
COST: 
METHOD : 


VESSEL-OVERALL-READINESS -OF8 5  9 
:  ERL 

«x>  <y» 

<<*>  <y>) 

( (VESSEL -OVERALL- READINESS -OF  <x>  <y>) ) 
NIL4 

(  (  (VESSEL-OVERALL-READINESS-OF  X  Y) ) 
(BINDTOERL  ((X  IUID)  (Y  RDY) )  IID.RDY)) 


The  name  and  owner  fields  are  straightforward:  the  service  has  a  unique  name  and  belongs  to  the  server 
named  :ERL  (in  a  current  implementation,  :ERL  is  the  server  that  can  access  the  Navy's  relational  database).  The 
pattern  is  also  particularly  simple,  a  single  clause.  Note  that  the  variables  printed  as  <x>  and  <y>  are  objects: 

NAME:  X 

TYPE :  : GEN 

CONSTRAINT :  VESSEL 


NAME:  Y 

TYPE :  : GEN 

CONSTRAINT:  READINESS-RATING 

Because  both  are  type  :GEN,  this  service  does  not  require  input  values  for  these  variables  The  pattern  will  match 
clauses  only  when  the  type  of  the  first  argument  (matching  <x>)  is  VESSEL,  and  the  type  of  the  second  argument 
(matching  <y>)  is  READINESS-RATING. 


The  method  field  for  services  belonging  to  the  :ERL  server  contains  two  pic'^-  fity.,  uie  pattern  mat  was 
matched;  second,  a  code-like  fragment  that  relates  variables  to  ficlua  and  specifies  a  table  from  which  to  draw  those 
fields.5 


The  scheme  for  indexing  services  establishes  a  pointer  from 
VESSEL-OVERALL-READINESS-OF  to  this  service. 


the  symbol 


8.5  Execution 


The  execution  phase  takes  an  execution  plan  (i.e..  a  list  of  partial  execution  plans),  and  iterates  through  it 
sequentially: 

For  each  partial  execution  plan  p 

Combine  the  data  from  the  streams  in  the  wrapper  of  p 
Call  the  execution  function  for  the  owner  of  p 
Pass  the  output  tuples  (according  to  the  dataflow  links  of  p) 
into  the  wrapper  objects  of  partial  plans  further  on 
Return  the  output  provided  by  the  last  partial  execution  plan 


1The  cost  field  is  unspecified,  therefore  this  scnne  is  assumed  (*>  have  unit  cost. 
i"h  is  is  a  simp  lit  ic  it  ion  the  t  ahle  spec  ills  am  in  m,i\  he  a  lr  lament  ol  f.R!  .ode.  complete  ss  ith  JOINs.  SI  I  ft  *1  .  rt. 
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The  execution  tunc  aon  (or  executor )  for  a  server  is  a  function  taking  the  following  arguments: 

1.  A  List  of  tuples  representing  input  values, 

2.  A  sequence  of  Janus  variables  identifying  the  tuple  elements,  and 

3.  Code  produced  by  the  execution  planner. 

The  execution  function  should  return  two  values: 

1.  A  list  of  tuples  representing  output  values,  and 

2.  A  sequence  of  Janus  variables  identifying  the  tuple -elements. 


8.6  An  Example  Backend 


Since  relational  databases  are  typical  services,  we  descnbe  one  here  bnefly.  The  previous  generation  of  IRUS 
made  use  of  a  set  of  rules  to  translate  from  MRL  expressions  to  ERL  (extended  relational  language)  expressions 
f  18].  Although  IRUS-II  makes  use  of  WML  rather  than  MRL.  and  although  there  is  now  an  additional  component 
(the  multiple-systems  component)  between  the  logical  form  and  the  underlying  system,  the  MRL-to-ERL  translation 
rules  (MRLRules)  still  serve  much  the  same  purpose  as  before:  they  relate  domain-model  concepts  and  roles  to  the 
relations  in  the  underlying  database.  As  such,  they  are  used  essentially  without  modification  in  IRUS-II 

They  do,  however,  serve  an  additional  purpose.  The  multiple-systems  component  makes  use  of  these  rules  in 
order  to  generate  its  model  of  the  capabilities  of  the  underlying  database.  This  process  has  been  automated,  and 
consists  primarily  of  creating  a  "service"  for  each  rule  that  describes  its  input  and  output  characteristics. 

MRLRules  are  the  mapping  from  the  domain  model  concepts  and  roles  to  Extended  Relational  Language. 
Each  MRLRule  provides  the  IRUS  system  with  the  information  it  needs  to  translate  one  domain  model  object  into 
an  SQL-based  database  access  language. 

The  function  that  installs  one  of  these  rules  is  DEFMl’SRule.  It  has  two  arguments,  a  form  specifying  the 
domain  model  object  you  are  providing  the  translation  for.  and  the  translation. 

(DEFMUSRule  (<dm  node  name>  &rest  args) 

(BindToERL  ((Argl  attl  ...)  (Arg2  ...) ... ) 

<An  ERL  relation>  i  ■ 

The  <dm  node  name>  is  a  symbol  that  has  a  concept  or  role  definition  in  the  domain  model.  The  number  of 
arguments  provided  will  effect  the  ways  that  this  rule  can  be  applied  dunng  translauon.  There  are  generally  2 
arguments.  If  there  is  only  one.  the  rule  can  only  be  used  to  translate  a  single  argument  predicate.  If  there  are  2  or 
more  arguments,  the  N-argument  rule  can  be  used  to  translate  an  N-ary  predicate,  an  (N-lt-ary  function  w  ith  1 
result,  or  an  (N-l  )-ary  predicate.  Binary  predicates  and  unary  functions  Lire  the  most  common  ease 

For  example. 

(DEFMUSRule  ( FOG  a  b)  ( BindToERL  n  a  too  A  i  (b  fooB  I  fooB2  )>  FooTahle.it 
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defines  the  translation  for  the  domain  model  object  named  FOO.  It  enables  the  system  to  translate  FOO  as  a  binary 
predicate  as  in  (and  (foo  vl  v2)  ...)  or  as  a  function  of  one  argument  as  in  (GT  (foo  vl )  v2)  or  as  a  unary  predicate 
as  in  (and  (foo  vl)  ...).  The  first  enforces  the  restriction  that  the  relation  FOO  exists  between  variables  vl  and  \  2. 
The  second  allows  the  system  to  generate  more  complicated  predicates  and  expressions  involving  the  FOO  relation. 
The  third  is  a  class  restriction.  It  ensures  that  the  FOO  relation  exists  between  vl  and  some  other  object.  IRUS  will 
probably  Only  use  one  form  of  expression  for  each  domain  model  object  but  the  MRLRules  allow  translations  of  any 
of  the  three. 


The  example  above  says  that  when  translating  a  predicate  (foo  vl  v2).  vl  is  represented  in  the  database  by  the 
field  fooA  in  the  table  FooTable.  and  v2  is  represented  in  the  database  by  the  fields  fooB  1  and  fooB2  in  the  table 
FooTable  To  enumerate  the  domain  of  the  FOO  predicate,  you  would  project  FooTable  over  fooA.  To  enumerate 
the  range  of  the  FOO  predicate,  you  would  project  FooTable  over  the  set  fooB  1  and  fooB2.  This  is  all  you  need  to 
tell  the  system.  IRUS  then  has  all  the  information  it  needs  to  use  expressions  involving  FOO  in  a  translation 


In  a  Navy  domain,  the  underlying  system  that  extracts  data  from  the  IDB  (integrated  database)  is  called  :ERL. 
For  "List  the  ships",  the  following  WML  expression  is  produced  by  the  front-end. 

(BRING- ABOUT 
( (INTENSION 

(EXISTS  ?JX1  LIST 

(OBJECT. OF  ?JX1  (IOTA  ?JX2  (POWER  VESSEL)  T) ) ) ) 

TIME  WORLD) ) 

The  normalized  expression  produced  by  the  back-end  is: 

(AND  (IN. CLASS  ?JX1  LIST) 

(IN. CLASS  ? JX2  VESSEL) 

(OBJECT. OF  ?JXI  ?JX2) 

(MEMBER  ?JX1  ?JX3) 

(IN.CLASS  ?JX3  (POWER  EVENT)) 

(RESPONSE  ? JX3) ) 

After  rewrites  and  pnntfunctions  have  applied,  the  output  ot  the  MUS  component  is: 

((RESPONSE  ? JX70 ) 

(NAMEOF  ? JX2  ?JX70) 

(IN.CLASS  ? JX2  VESSEL)) 

To  answer  the  question  "List  the  ships",  the  NAMEOF  and  IN.CLASS  relations  are  combined  and  the  result 

is  sent  to  the  response  generator.  The  translation  rules  used  are: 

(DEFMUSRule  (NAMEOF  \  y  > 

(BindToERL  ((X  Il'lDuY  SHIPNAMEi)  IID  UCHAR  o 

(DEFMUSRule  (Vessel  Xi 
(BindToERL  ttX  IUIDm  IID  UCHAR  ) 

Note  that  (IN.CLASS  X  V  essel)  is  equivalent  to  i  Vessel  Xi. 

The  two  ERL  fragments  arc  joined  and  then  optimized.  The  resulting  S(jL  statement  is  "Select  l LTD. 
SHIPNAME  From  I1D  UCHAR  "  The  results  are  then  made  available  to  other  sy  stems  by  placing  them  in  the 
wrapper  tor  the  partial  execution  plan. 
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Appendix  A 

Installation  Instructions 

The  IRUS-II  system  consists  of  two  modules,  called  the  Front  End  and  the  Back  End.  The  Front  End  lakes  an 
English  sentence  as  input  and  returns  a  logical  representation  of  that  sentence  in  the  World  Model  Language 
(WML).  The  Front  End  contains  the  grammar,  lexicon,  interpretation  rules,  and  domain  model. 

The  Back  End  takes  a  WML  as  input  and  transforms  it  into  one  or  more  commands  to  underlying  systems, 
executes  those  commands,  and  combines  the  results  from  the  Underlying  Systems. 

The  various  modules  and  systems  that  make  up  the  Front  and  Back  ends  are  changing  as  BBN's  Natural 
Language  technology  develops.  The  following  section  details  a  snapshot  of  the  current  IRUS-II  system.  The  names 
of  these  systems  are  subject  to  change.  In  order  to  install  the  FCCBMP  version  of  IRUS-II,  the  procedure  is  to  load 
a  distnbution  tape  (using  the  Read  Distribution  command  processor  command)  and  then  to  do  Load  System  for 
each  of  the  major  systems  included  in  the  IRUS-II  system,  as  outlined  below. 

A.l  Systems  Included  in  IRIS-II 


IRUS-II  currently  consists  of  the  following  major  systems,  each  of  which  also  may  include  component 
subsystems: 

•  The  systems  CLOS.  BASIC-NEWTsLREME,  CONDITIONS.  NEWKREME.  NEW- WINDOWS  (which 
includes  basic-grapher  and  clustered-windows),  and  WINDOW-EDIT  together  provide  the  KREME 
knowledge  editing  environment,  on  which  the  KNACQ  acquisition  system  is  based. 

•  The  system  IRUS-II  (which  includes  isi-elisp.  isi-mkl.  clisp-records  and  janus-front-end)  represents  the 
core  of  the  natural  language  parser  wuth  support  code 

•  JANUS-KNACQ.  NIKL-COMP  AT  ABILITY,  and  KUI  adjust  the  parser  to  work  with  the  KNACQ 
knowledge  acquisition  system,  and  load  the  domain  model  and  dictionaries. 

•  The  system  EMBEDDED-GRAPHER  provides  for  graphing  parse  trees. 

•  The  system  PARAPHRASER  (which  includes  mumble86.  text-planner-core,  parrot-core,  and  parrot- 
navy-knacqi  adds  the  generation  capabitihes 

•  MUS-FGL-JANUS  (which  includes  basic  back -end  and  mus-fol-basic).  ERL-Ml  S.  and  ERL-INTERP 
make  up  the  back-end  components  which  translate  from  the  internal  meaning  representation  to  a 
modified  first  order  logic  (FOL)  and  direct  the  computation  in  the  multiple  underlying  systems  < MUS i 
to  get  the  answer 

•  The  JANUSKA-I  si-stem  (which  includes  irus-!>  provides  the  frames  and  windows  for  this  v  >n  of 
the  IRUS-U  interlace 
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A. 2  Installation  Instructions 


BBN  distributes  software  in  two  formats: 

•  IDS  files  on  FEP  tapes 

•  Binaries  and/or  sources  on  Distribution  Tapes 

This  section  describes  how  to  load  software  from  each  of  these  distribution  formats. 

A. 2.1  Loading  IDS  Files  From  FEP  Tape(s) 

A  world  made  of  IDS  files  consists  of  one  or  more  "layers”  Each  IDS  file  is  a  "layer"  of  core  image.  In  order 
to  boot  a  world  made  from  IDS  files,  you  must  first  make  sure  all  the  required  IDS  files  and  the  correct  microcode 
are  on  your  machine,  and  that  you  have  a  correct  boot  file. 

BBN  cannot  distnbute  Symbolics  software,  including  operating  system  distribution  loads  and  microcode  files. 
You  must  get  them  from  Symbolics. 

A. 2. 2  Do  You  Ha\e  The  Correct  IDS  Files? 

The  FEP  maintains  a  database  of  IDS  files.  The  following  commands  operate  on  that  database: 

•  Clear  IDS  Files  clears  the  database, 

•  Find  IDS  Files  searches  the  FEP  for  IDS  files,  adding  them  to  the  database 

•  Show  IDS  Files  displays  the  database 

•  Add  IDS  File  adds  an  IDS  file  to  the  database. 

NOTE:  These  IDS  commands  allocate  memory  within  the  FEP  as  they  run  This  memory  is  never  deallocated,  so 
the  FEP  runs  out  of  memory  on  occasion  When  this  happens,  you  have  to  reset  the  FEP. 

To  look  at  your  IDS  files,  use  the  command  processor  command  Show  Fep  Directory  :type  World.  This  will 
display  all  the  IDS  files  on  the  FEP.  indented  as  to  generation.  (See  the  Symbolics  manual  for  interpretation  of  the 
display.)  In  order  to  boot  a  world  containing  an  IDS  file,  you  need  all  the  ancestuis  of  that  IDS  file. 

It  you  are  missing  ancestors,  either  you  didn't  read  all  the  files  from  the  FEP  tapes  BBN  sent  you,  or  you  have 
to  get  operating  sy  stem  or  microcode  files  from  Symbolics. 
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A. 2.3  Resetting  the  FF.P 

When  the  FEP  mns  out  of  memory.  use  the  command  Reset  Fep.  This  will  spin  down  the  disks  and  clear  the 
FEP's  memory.  If  tiie  video  image  on  your  console  loses  sync  (i.e.  you  can’t  read  it  and  you  get  nothing  but  snow 
or  moving  bands  of  black),  that  means  you  have  the  wrong  monitor  type  setting.  There  are  two  types  of  monitors,  so 
try  resetting  the  monitor  type  with  the  following  commands: 

•  Set  M  M 

•  Set  M  P 

One  of  these  should  work.  After  resetting  the  FEP  and  setting  your  monitor  type,  use  the  Hello  command  to 
initialize  the  FEP  and  spin  up  the  disk. 

A. 2.4  Do  You  Hate  The  Correct  Microcode? 

The  correct  microcode  file  for  a  band  has  the  correct  version  number  for  the  band  you  are  booting,  and  the 
correct  file  name  for  the  hardw  are  configuration  you  are  running  on.  To  determine  the  correct  file  name,  look  at  a 
boot  file  on  your  machine  that  boots  successfully .  When  you  boot  a  world,  the  FEP  prints  a  message  containing  the 
loaded  microcode  version  number  and  the  version  number  expected  by  the  band  being  booted.  NOTE.  It  is  posSiole 
to  load  the  right  microcode  version  number  for  the  wrong  hardware  configuration.  The  world  might  successfully 
boot  and  later  die  a  mysterious  death.  For  further  information  on  microcodes,  see  the  Sy  mbolics  Manuals. 

A. 2.5  Reading  FEP  Tapes 

The  IRUS-Il  system  requires  a  Symbolics  machine  with  Genera  7  2  already  installed  and  with  sufficient 
blocks  free  in  its  FEP.  The  number  o!  blocks  required  for  each  system  is  w  ritten  on  the  tape  containing  that  system 
If  your  machine  has  no  tape  drive,  you  will  have  to  read  the  tape  on  another  machine  that  does  have  one  and  then 
transmit  the  bands  to  your  machine.  We  will  use  the  terms  destination  machine  and  tape  drive  machine  to  refer  to 
these  two  machines. 

The  FEP  Tape  program  is  used  to  read  FEP  tapes  I'se  the  command  Select  Activity  FFP-TAPE  to  select 
the  FEP  Tape  program. 

Once  the  FEP-Tape  screen  appears,  you  .ire  ready  to  read  the  tape.  If  the  destinauon  machine  and  the  tape 
drive  machine  are  the  same  (the  machine  you  are  currently  using  has  a  tape  dnvei.  then  you  just  need  to  execute  the 
following  command: 

Read  Tape  :Full  Length  Tapes  Yes 

If  the  destinanon  machine  and  the  tape  drive  machine  are  different,  then  execute  the  follow  . ng  command 

Read  Tape  :Full  Length  Tapes  Yes  :Host  tape  host  maehtne 

For  each  tile  on  the  tape,  the  Read  Tape  command  prompts  with  the  name  of  the  file,  allowing  you  to  specify 
either  where  to  put  (lie  tile  or  to  skip  reading  the  file.  Just  use  the  default  name  that  you  are  presented  with 


75 


BBN  Systems  and  Technologies  Corporation 


Report  No.  7144 


For  more  information  on  the  Read  Tape  command  for  the  FEP-Tape  facility.,  see  the  Site  Operations  (vol  Ui 
manual  of  the  Symbolics  documentation  set 


A. 2.6  Editing  The  Boot  Files 

Now  you  must  create  a  file  inis-ii.boot.  Boot  files  have  machine  specif  information  in  them,  including  the 
hardware  configuration  (in  the  name  of  the  microcode  filet  and  the  network  address  of  the  machine.  Boot  files 
copied  from  another  machine  must  be  edited  before  use!  Whenever  you  do  this,  you'll  at  least  have  to  change  the 
network  address,  and  perhaps  the  microcode  file,  if  it's  a  different  machine  configuration. 

A. 2. 7  Booting  the  BBN  NL  System 

Type  the  following  to  the  FEP 

Boot  irus-ii . boot 

Because  the  band  is  being  booted  at  a  site  other  than  the  site  at  which  it  was  built,  the  machine  will  ask  you  it 
the  site  is  still  BBN.  Answer  NO  and  the  machine  will  name  itself  DIS-LOC  AL-HOST 

If  the  machine  has  identity  problems  (thinks  it  is  still  at  BBN  ;.  the  simplest  way  to  deal  with  them  is  to  unplug 
the  e  the  met  before  booting.  See  your  local  system  wizard  it  you  want  a  more  elegant  solution 

If  you  want  your  machine  to  regain  its  correct  identity,  you  Can  use  the  Set  Site  command  to  tell  it  that  it  is  at 
your  sue.  instead  of  M  BbN.  You  need  to  know  the  chaos  address  of  your  name-pace  serve'  machine.  See  the 
Symbolics  documentation  about  this  command 


A. 3  Loading  (  urr\  Tapes 


In  some  cases,  you  may  receive  a  lap.  with  demonstration  files  cr  other  ind'vidual  tiles  on  it.  These  are 
usually  sent  on  a  tape  format  c  'lieu  a  (  arr\  I  apt 

To  load  a  cam  -tape,  u-.e  the  command 

(tape  :  cany-lcad) 

You  will  have  to  cht«  se  a  place  on  -.our  in.rehine  not  >i.  ,  i  tiles  loaded  from  a  ..am -tape 
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A. 4  Transferring  FF.P  Files  Onto  Other  Machines 


It  you  want  to  have  the  IRUS-Ii  software  on  more  than  one  machine,  you  can  use  the  Copy  World  command 
processor  command  to  copy  the  FEr  wtulu  ides  onto  other  machines.  Yuu  can  als^/  u.-w.  Jic  unsi.m-oaiiu  juJ 
si:receive-band  functions 

For  instance,  if  you  want  to  transfer  FEP  file  from  the  machine  you  are  currently  logged  into  to  another 
machine  you  can  evaluate  the  following 

(si  :  transmit -band  "fepO  :  >file-name  .  load"  destination-machine) 

After  you  copy  the  FEP  tiles,  you  need  to  make  sure  your  destination  machine  has  an  irus-ii.boot  file. 


A. 5  Loading  a  Distribution  Tape 


To  load  a  distribution  tape,  use  the  Read  Distribution  command  processor  command  When  loading 
distribution  tapes,  it  is  not  necessary  that  the  target  machine  have  a  tape  drive.  The  load-distribution-tape  command 
asks  what  machine  the  distribution  tape  is  in.  and  reads  the  tape  across  the  network. 

Once  the  files  have  been  loaded  onto  the  destination  machine,  use  the  Load  System  command  The  correct 
form  of  Load  System  will  be  listed  on  the  tape  label 
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